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Preface 


The purpose of this book is to provide the student beginning 
undergraduate mathematics with a solid foundation in the basic 
logical concepts necessary for most of the subjects encountered in a 
university mathematics course. 

The main distinction between most school mathematics and 
university mathematics lies in the degree of rigour demanded at 
university level. In general, the new student has no experience of 
wholly rigorous definitions and proofs, with the result that, although 
competent to handle quite difficult problems in, say, the differential 
calculus, he/she is totally lost when presented with a rigorous 
definition of limits and derivatives. In effect, this means that in the first 
few weeks at university the student needs to master what is virtually 
an entire new language (‘the language of mathematics’) and to adopt 
an entirely new mode of thinking. Needless to say, only the very ablest 
students come through this process without a great deal of difficulty. 
Unfortunately, whilst it is understood by all university teachers of 
mathematics that it takes a great deal of time and effort to master the 
new crafts, very little class time can be made available to cover the 
necessary material (and indeed, even if it were, it is difficult to see how 
it could be successfully used, since what the student really requires is 
time to ponder over the new ideas: it is not the amount of material 
that causes the problems, rather its strangeness). It is intended that 
this book will be of use to the student during the first term at 
university (and perhaps also during the weeks prior to entering the 
university). 

The material covered represents only a tiny fragment of the typical 
first year mathematics syllabus, and indeed is usually ‘dealt with’ in 
the first two weeks of lectures. But time spent mastering this material 
pays dividends later on, particularly when the student begins the 
Analysis course: always a traumatic event! 
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viii PREFACE 


I have strived to keep the book as short as possible, whilst keeping 
in mind my intention that the weakest student should be able to read 
through it unaided. This has meant omitting many topics, e.g. number 
systems, countability, relations, continuity, which would arguably fall 
within the scope of a book of this nature. However, there are available 
on the market a number of excellent introductory texts in Algebra and 
Analysis which deal with these topics, and in any case they fall quite 
clearly within the syllabuses of the various subjects concerned. So I 
have included only those topics which are basic to almost every area of 
mathematics, and without which it is practically impossible to begin 
anything at all. 

In Chapter 1, we examine those parts of language which are so 
crucial to (pure) mathematics and which traditionally cause a great 
deal of trouble to beginners (in particular, the negation of statements 
involving quantifiers and implications). This is not a crash course in 
mathematical logic in the formal sense — rather just an attempt to 
make precise those points of language which play such a vital role in 
mathematical discourse. It is designed to prepare the student for 
his/her other courses as quickly as possible. Some lectures in 
mathematical logic could subsequently be given to strengthen this 
initial coverage, should the instructor feel this is desirable — but this 
would fall outside the narrow aims of this book. 

Chapter 2 deals with sets and functions, which, though intrinsically 
easy, cause many problems for students who meet this for the first 
time at university level. 

Next, in Chapter 3 we take a brief look at the real numbers. As 
much as anything this chapter provides concrete applications and 
examples of the material covered in the first two chapters. Though 
this will clearly prepare the ground for an Analysis course, it is not 
intended to supersede any of the parts of such a course. In particular, 
convergence of sequences is included purely as an illustration of 
the behaviour of quantifiers. 

Finally, in Chapter 4 we present a crash course in complex 
numbers. Though this is primarily designed for the reader who has 
not covered this topic at school, the earlier parts of this chapter 
should be of interest to all readers, and tie in with chapter 3 to some 
extent. 


Note for the student 


Unless you have done what is known as a ‘modern maths’ syllabus at 
A-level, almost everything in this book will be new to you, and will 
doubtless seem very strange. Indeed, you may feel that it is not 
‘mathematics’ at all. With patience and a fair amount of hard work, 
this stage will soon pass. Do not try to rush through the book in order 
to ‘keep up with the lectures’. Take it steadily, and try to understand 
the new concepts. There is little to learn, but a great deal to 
comprehend! (The actual facts contained in this book could be listed 
on three or four pages of notes.) And try the exercises! They are 
included for a purpose: to aid your understanding. Discuss any 
difficulties which arise with your colleagues or with your tutor. Do 
not give up. Last year’s students managed it. So did the previous 
year’s. So did we all. So will you! 


Further reading 


The following book is suggested for further reading: The Foundations 
of Mathematics by I. Stewart and D. Tall (Oxford University Press, 
1977). 


CHAPTER 1 


Use of language in 
mathematics 


It is probably impossible to formulate a precise definition of 
(contemporary) pure mathematics (except perhaps to say that it is 
what contemporary pure mathematicians do for a living!). But one 
thing is clear, in pure mathematics one is concerned with statements 
about mathematical objects. 

Mathematical objects are things such as: integers, real numbers, sets, 
functions, etc. Examples of mathematical statements are: 


(1) There are infinitely many prime numbers. 

(2) For every real number a, the equation x? +a =0 has a real root. 

(3) \/2 is irrational. 

(4) If p(n) denotes the number of primes less than or equal to the 
natural number n, then as n becomes very large, p(n) approaches 
n/log.(n). 


Not only are we interested in statements of the above kind, we are, 
above all, interested in knowing which statements are true and which 
are false. (The truth or falsity in each case is demonstrated by a proof.) 
For instance, in the above examples, (1), (3), and (4) are true whereas 
(2) is false. 

The truth of (1) is easily proved. We show that if we list the primes in 
increasing order as 


Py, D2 P3>-++>Pw-++> 


then the list must continue for ever. (We all know what the first 
few members of the sequence are: p,;=2, p, =3, p3=5,p,=7, 
ps; =11,....) Consider the list up to stage n: 


P1> Pa P3>- - ->Dn 
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Let 


P=(P; P2 P3--- Pr +1 


If p is not a prime, there must be a prime q < p such that q divides p. 
But none of p,,..., p, divides p, for the division of p by any one of 
these leaves a remainder of 1. So, either p must itself be prime, or else 
there is a prime q < p which exceeds p,. Either way we see that there is 
a prime greater than p,. Since this argument does not depend in any 
way upon the size of n, it follows that there are infinitely many primes. 

Example (2) can easily be proved to be false. Since the square of no 
real number is negative, the equation x? + 1 = 0 does not have a real 
root. 

We give a proof of (3) later. The only known proofs of example (4) 
are extremely complicated. 

Before we can prove whether certain statements are true or false, we 
must be able to understand precisely what the statement says. Above 
all, mathematics is a very precise subject, where exactness of 
expression is required. This already creates a difficulty, for in real life 
our use of language is rarely precise. Now, to systematically make the 
entire English language precise (by defining exactly what each word is 
to mean) would be an impossible task. It would also be unnecessary. It 
turns out that by deciding exactly what we mean by a few simple 
words, we can obtain a ‘language’ which is precise. The point is that in 
mathematics we don’t use all of the English language. Indeed, when 
we restrict our attention to mathematical statements (rather than our 
attempts to explain them), we need only a very small part of our 
language. 


1.1 The language of mathematics: part 1 


We shall make precise a few key words of the English language, 
thereby enabling us to formulate in an exact and unambiguous 
fashion any mathematical concept we wish to consider. The difficulty 
for the beginner is that the mathematical usage of some of these words 
is not quite the same as the everyday, non-mathematical usage. 
However, there is no such problem with the first word considered 
below: and. 

We need to be able to assert that two events simultaneously hold. 
For instance, we may wish to say that 7 is greater than 3 and less than 
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3-2. So the word 


and 


is indispensable. Sometimes, in order to shorten an expression, we 
introduce an abbreviation for and. The most common are 


AN , & 


Thus, the expression 
(x > 3) A (nm < 3-2) 


says: 7 is greater than 3 and less than 3-2. In other words, 2 lies 
between 3 and 3:2. 

There is no possible source of confusion when we use the word and. 
(If @ and w are any two mathematical statements, @ A W is the 
assertion (which may not be a valid assertion) that both @ and w are 
true.) The same cannot be said of our next ‘word’. We wish to be able 
to assert that event A occurs or event B occurs. For instance, we might 
want to say 


either the equation x? +a =0 has a real root 
or a>0O 
or perhaps we want to say 
ab=0 if a=0 or b=0 


Now, the use of ‘or’ is different in these two examples. In the first case 
we Say ‘either... or ...., which emphasizes that there is no possibility 
of both occurring (at once). In the second case, it is quite possible for 
bothaand b to be zero. Even if we were to alter our second example by 
inserting an ‘either’, we would still read it as if both possibilities could 
occur at once: the use of ‘either’ only helps to strengthen an assertion 
where it is already clear that there is no possibility of both. But there is 
no harm in dropping the use of the word ‘either’ altogether. The first 
example above remains true without the word ‘either’, even if by ‘or’ 
we understand the possibility that both may occur. True, with this 


4 SETS, FUNCTIONS AND LOGIC 


example both cannot occur. But that does not affect the truth of the 
assertion. 

Accordingly, when we use the word ‘or’ in mathematics we always 
mean the ‘inclusive-or’. If ¢ and Wy are mathematical statements, @ or 
W is the assertion that at least one of $ or is valid. We usually 
abbreviate or by the symbol v . Thus ¢ v means at least one of ¢ 
and w are valid. 

For instance, the following statement is true: 


(3 <5) v (1 =0) 


(Beginners often make the mistake of trying to argue that this is ‘false’ 
because | cannot equal 0. They fail to grasp that all that is happening 
is that we decide on the convention that ‘or’ has the meaning ‘at least 
one of’. This does not agree with many uses of ‘or’ outside of 
mathematics, but for mathematics this definition chosen has many 
advantages.) 

For our next word, we need to be able to negate a statement, to say 
that a particular statement is false. So we need the word 


not 


If ¢ is any statement, not-¢ is the assertion that @ is false. Thus, if ¢ is a 
true statement, not-¢ is a false statement; and if ¢ is a false statement, 
not-@ is a true statement. We usually abbreviate not by the symbol 


— 


(Older texts sometimes use the symbol ~ but this has long since 
dropped from common usage.) 

Now, although our usage of not accords with most common usage, 
the negation concept is sometimes used very loosely in everyday 
speech. For instance, there is no confusion about the meaning of the 
statement 


— (x < 3) 
This clearly means 


n23 


(which, incidentally, is the same as (z = 3) v (a > 3)!). But consider the 
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statement 
All British cars are badly made 


What is the negation of this statement? 


(a) All British cars are well made. 

(b) All British cars are not badly made. 

(c) At least one British car is well made. 

(d) At least one British car is not badly made. 


A common mistake is for the beginner to choose (a). But this is 
obviously wrong. Our original statement is false (consider Rolls 
Royce). Hence the negation of this statement will be true. But (a) is 
certainly not true! Neither is (b) true. So realistic considerations lead 
us to conclude that the correct answer is either (c) or (d). (We shall 
later see how we can eliminate (a) and (b) by a mathematical argument.) 
Both (c) and (d) can be said to represent the negation of our original 
statement. (Rolls Royce testifies the truth of both (c) and (d).) Which 
do you think most closely corresponds to the negation of our original 
statement? (We come back to this example later, but before we leave it 
now, let us remark that the original statement is only concerned with 
British cars. Hence its negation will only deal with British cars. So the 
negation will not contain any reference to foreign cars. For instance, 
the statement 


All foreign cars are well made 


is not the negation of our original statement. Indeed, knowing 
whether our original statement is true or false in no way helps us to 
decide the truth or falsity of the above statement. True, ‘foreign’ is the 
negation of ‘British’ (in Britain, anyway), but we are trying to negate 
the assertion as a whole, not some adjective occurring in it.) 

So far we have considered three basic words and made their 
meaning precise: and (A, &), or (v), not (—). We refer to these 
operations as conjunction, disjunction, and negation, respectively. 
Thus ¢ A wW is the conjunction of the statements ¢ and W, — @ is the 
negation of @, etc. 

Our fourth word causes more initial confusion than all three of 
these together. We need the word ‘implies’. We want to be able to say 
that statement ¢ implies statement w. Indeed implication is the way we 
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prove statements. We shall write 
oy 


to mean that ¢ implies w. (Modern texts use a single arrow, —, instead 
of =, but to avoid confusion with our function notation later we 
adopt the more old-fashioned notation here.) 

Now, we all know what ¢=w means. Or do we? Well, it certainly 
means 


if @ is true, then w has to be true as well 


Thus, if @ is a true assertion, then ¢=>w will be a true assertion 
providing wy is true. But wait; suppose for ¢ I take the true assertion 
De 2 is irrational’ and for I take the true assertion ‘0 < 1’. Then is the 
implication ¢=w true? In other words, does the irrationality of ,/ 2 
imply that 0 is less than 1? Of course it does not. Again, what can be 
said about the truth of the implication ¢= y if ¢ is false? A confusing 
state of affairs indeed. It would appear that @=>w is often mean- 
ingless. But we cannot have meaningless statements floating about. 
The truth or falsity of the implication ¢= wy should depend upon the 
nature of ¢ and y, but not the meaningfulness of this implication. And 
if we leave certain cases without a meaning we soon find ourselves in 
an appalling mess. For instance, it is possible to construct a perfectly 
logical argument which does deduce the fact that 0<1 from the 
assumption that,/ 2 is irrational. (/ 2 being irrational will clearly play 
a rather redundant role in such an argument, but so what?) By now, 
the reader is no doubt thoroughly confused — which is good. Because 
now he should be ready to accept the way out of the dilemma. We 
shall make precise what is meant by the implication ¢= w being ‘true’ 
or ‘false’. In all cases where the implication @=> W is ‘meaningful’, our 
definition will agree with the ‘usual idea’. But we shall eliminate all 
possible confusions (in a strict sense, though you are forgiven if you 
still feel a little bewildered by all this) by assigning (in a sensible 
manner) a meaning to all eventualities. 

The way we arrive at our definition is to consider not the notion of 
implication but its negation. Let us try to assign a precise meaning to 
the statement ‘¢ does not imply W’, which we will write as 


ory 
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(though we could use — (6=w) instead.) Well, leaving aside all 
question of whether there is a ‘meaningful’ causal relation between @ 
and w (to avoid uh 2 and 0 < 1 type difficulties), how can we be sure 
that ¢ + isa valid statement? More precisely, how should the truth 
or falsity of the assertion ¢=+ W depend upon the truth or falsity of @ 
and W? Well, ¢ will not imply w ifit is the case that although @ is true, 
is nevertheless false. Please read this last sentence again. Now once 
more. O.K., now we are ready to continue. (On second thoughts, 
maybe you should read it a fourth time.) By letting the truth of 
o@=+w depend only upon the truth or falsity of @ and W, we have re- 
moved all uncertainties about whether there is really a ‘sensible’ 
relation between @ and w. And our notion of 6+w certainly agrees 
with all special cases where we do have a ‘meaningful’ relationship. 

Having defined the truth or falsity of 6+ we obtain that of d=>w 
by just taking the negation. => w will be true exactly when 6+ wy is 
false. Examination of this definition leads to the following 
conclusions: 

o=w will be true whenever one of the following holds: 


(1) @ and w are both true. 
(2) @ and w are both false. 
(3) @ is false and w is true. 


The points to note are: 


(a) We are defining what ‘implies’ should mean. 

(b) To avoid difficulties, we base our definition solely on the notion of 
truth and falsity. 

(c) Our definition agrees with our intuition in all ‘meaningful’ cases. 


Closely related to implication is the notion of ‘equivalence’. Two 
statements ¢ and w are said to be equivalent, written 6 (or Pow 
in modern texts), if d= W and y= @. By looking back at the definition 
of implication, this means that ¢ and w are equivalent if they are both 
true or both false. The three remarks (a), (b), (c) above apply to 
equivalence just as to implication. 


Exercise 1.1 


(1) Express in a concise manner the meaning of the following 
statements: 
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(a) (x>0) A(x < 10) 
(b) (3<4)A (3 <6) 
(c) (e<4) A (e? <9) 
(d) (x>0)v (a> 1) 
(e) (x<0)v (x>0) 
(2) Which of the following are true and which are false? 


(a) (x? >2)(n> 1-4) 

(b) (x? <0)>(x=3) 

(c) (x? >0)>(14+2=4) 

(d) (x<12?)>(x=5) 

(e) (e? 20)>(e <0) 

(f) —[(5 is an integer)=>(5? > 1)] 

(g) (the area of a circle of radius 1 is z)=>(3 is a prime) 


1.2 Properties of the language 


So far we have taken five common concepts, and, or, not, implies, 
equivalence, and made their meanings precise. Thus, whenever 
precision is necessary, we can use these words with absolute 
conviction: there should be no possible cause for confusion (except 
due to misunderstanding of the concepts, which can only be overcome 
by experience on the part of the student). Of course, our precise 
‘language of mathematics’ will include other ‘symbols’ or ‘words’ such 
as <, =, less than, z, e, function, etc., but these all have a precise 
definition already (since they are mathematical concepts). It is only 
those parts of the ‘language’ which we take from common usage which 
need clarification and eventual definition. There are two further 
concepts which we take from everyday speech, but we shall postpone 
those until later. In the meantime we consider the language developed 
so far. That is, we investigate the consequences of our formal 
definitions, and also introduce some common terminology. 

We consider first the behaviour of negation. What is the meaning of 


(ory) 


where ¢@ and & are some mathematical statements? 
Literally, @ A w means ‘both ¢ and w are true’, so —(¢@ A W) means 
‘jt is not the case that both ¢ and w are true’. Well, if they are not both 
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true, then at least one of them must be false. But to say ‘at least one of @ 
and w is false’, is clearly the same as saying ‘at least one of — @ and 
— w is true’ (by our definition of negation). By our meaning for ‘or’, 
this can be expressed as (~ ¢) v (~ W). Thus 


(paw) and (~4)v(—yW) 


are equivalent (i.e. mean the same). 
Likewise it can be shown (exercise: carry out the required logical 
argument) that 


—(@vy) and (~4)r(-yW) 


are equivalent. 

Thus, negation has the effect that it changes v into A and 
changes A into v . Now re-read the above ‘proof’ that —(@ A W) and 
(— @) v (—W) are the same. 


(Question: what does —(— ¢) say?) 


The effect of negation on implication is already bound up with the 
definition of implication. 
By our definition 


oy 
is the same as 
(—o)v 
And 
(o>) 
is the same as 
bA(—W) 


Of course, once we have made the meanings of various words precise, 
and introduced abbreviations, we are free to alter our notation if it 
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seems helpful. Thus, we usually write 


ory 
instead of 
(>) 
Likewise, we use the more familiar 
x#y 
instead of 
> (x= y) 
Another example: in dealing with real numbers, we usually write, say, 
a<x<b 
instead of 
(a< x) A(x Sb) 
But we would probably write 
—(a<x<b) 
instead of 
at¢x€b 
as the latter would appear to be rather unclear as to its meaning. (We 
could make this precise, but it still seems rather inelegant, and as such 
is best avoided.) 
There is some terminology associated with => and <> which should 


be mastered straight away, as it pervades all mathematical discussion. 
We can read ¢>w as ‘w holds if @ holds’, i.e. 


vy if o 
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Likewise => @ can be read as 


> if w 
The following way of reading 6<>W follows now from the definition of 
oy as (p>) A (w=): 
Read yw as 


> iff w 
(i.e. if A fi, reversing the direction.) In fact it is more common to write 


> if 


(dropping the second i). Of course, it is rather difficult in giving a 
lecture to distinguish between ‘if? and ‘iff. But we do not read ‘iff? 
aloud as a prolonged ‘if’. We make the following observation. We can 
read 6<>W as 


W if and only if @ 


The ‘if’ comes from the ¢= y, the ‘only if? coming (in the presence of 
the ‘if’) from the y > ¢. Of course, when we have an equivalence, the 
order does not matter, so we could also write (or say) 


@ if and only if w 
This is how we read ‘iff’. So: 


(a) We write ‘iff’. 
(b) We read it as ‘if and only if’. 
(c) We mean ‘they are equivalent’. 


Now, the use of ‘iff’ causes no problems once the student knows it 
means ‘equivalent’ (or ‘implication both ways’). (The usage of ‘w if ¢” 
to mean ‘¢ implies y’ is so rare that it can be ignored just now.) This 
cannot be said of our next pair of words. 

Consider the implication ¢=>w. What this says is that for to be 
true it suffices that ¢ be true. Or, @ is a sufficient condition for w. It also 
says that in order for ¢ to be true, it is necessary that w be true (though 
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perhaps not sufficient!) So: is a necessary condition for @. The words 
‘necessary’ and ‘sufficient’ in this context are very widespread, which 
is a pity because in our experience everyone (including the experts) 
has to stop and think carefully which is which. Here it is in a diagram 


¢o => Ww 


| 


sufficient necessary 


(Think of the word ‘sun’. This will remind you of the order.) So, if you 
must show that ¢ is a necessary condition for y you must prove p => @. 
And to show that @ is a sufficient condition for Wy you must prove ¢ > 
w. Still worried? Fortunately, you usually need to show that w is both 
necessary and sufficient for @.(To put it another way, mathematicians, 
afraid of showing themselves up by getting things the wrong way 
round, usually only use these words together: in which case the 
meaning is the same as ‘iff.) So, if you need to show that w is necessary 
and sufficient for ¢, what is required is a proof of the implication both 
ways. (But beware the sadistic setter of examination questions! Be 
prepared!) 

And so to some light relief to finish the section. Since we have 
defined all of our logical connectives (A, Vv , —, =>, <>) by reference to 
truth and falsity alone, and not to meaning, it is possible to represent 
(or illustrate) them by means of a table: a truth table. We introduce 
two symbols: T, to denote ‘true’ and F to denote ‘false’. The definition 
of @ A W can be illustrated by the table: 


¢ wv oar 


a aHy4 
4 


In the first two columns appear all the possible combinations of values 
of T and F which the two statements ¢ and w can have. In the third 
column we see the value ¢ “ w achieves according to each assignment 
of T or F to ¢ and w. Thus, we see that @ A W is T only when both @ 
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and w are T. Similarly, the definition of — @ can be represented thus: 


b —¢ 
T F 
F iT 


One can go on to construct truth tables for more complicated 
expressions. Consider, for example, the expression (¢ A W) v(— @). 
We can build its table column by column as follows: 


¢ yp orp —b (PrAWV(—-4) 


a74H44 
~Hm4 
4 
4H 
Hynes 


We can also draw up tables for expressions such as (¢ A W) v 0, but if 
there are n ‘primitive statements’ involved there will be 2” rows in the 
table, so already (f 4 W) v @ needs 8 rows! 

Truth tables can be useful in checking that two rather complex 
statements are equivalent. For (by our definition of equivalence) two 
statements will be equivalent if they have the same truth table. For 
example, we can demonstrate the equivalence of 


—=(¢AW) and (~¢)v(—y) 


as follows: 
* * 
p w praw =(¢rv) =~ ww (—o)v(-W) 
T T T F F F F 
T F F T F T T 
F T F T T F T 
F F F T T T T 


The two columns marked * being identical shows that the two 
expressions are equivalent. 
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Exercise 1.2 


(1) Which of the following conditions is necessary for the natural 
number n to be divisible by 6? 


(a) nis divisible by 3 

(b) nis divisible by 9 

(c) nis divisible by 12 

(d) n=24 

(e) n? is divisible by 3 

(f) nis even and divisible by 3 


(2) In (1), which conditions are sufficient for n to be divisible by 6? 

(3) In (1), which conditions are necessary and sufficient for n to be 
divisible by 6? 

(4) Let m,n denote any two natural numbers. Prove that mn is odd iff 
m and n are odd. 

(5) With reference to (4), is it true that mn is even iffm and nare even? 

(6) Show that 6<w is equivalent to (= ¢)<>(— w). How does this 
relate to your answers to questions (4) and (5) above? 

(7) Draw truth tables to illustrate the following: (a) ¢ v W:(b)¢>y; 
(c) oy. 


(8) Use truth tables to prove that the following are equivalent: 


(a) —(¢=>¥) and 6 A(—y) 
(b) p=>(W A 8) and (p>) A (o=>8) 
(c) (Pv b)=0 and (p +0) a (W +8) 


(9) Verify the equivalences in (b) and (c) above by means of a logical 
argument. (That is, in (b) say, assuming ¢ and concluding wy A 6 
is the same as both deducing W from ¢ and @ from ¢.) 

(10) Use truth tables to prove the equivalence of 


o=y and (—)>(—¢) 


(11) Use truth tables to show that ¢=wW and w=>¢ are not 
equivalent. Give an example of statements @ and w such that 
o=vw is true but y= ¢@ is false. 

(12) Connected with question (11), which of the following are 
equivalent? 


(a) PAW and Wad 
(6) dvw and Wve 
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1.3 The language of mathematics: part 2 
And so to the final part of our language. We introduce the two 
quantifiers. In mathematics we frequently make existence assertions, 
e.g. 

the equation x* + 2x + 1 =0 has a real root 
or, to put it another way, 
there exists a real number a such that a? + 2a+1=0 

or, 


“2 is rational 


which does not look much like an existence statement until we write it 
in the form 


there exist natural numbers p and q such that Ni 2=p/q 
The first example here is a true existence statement (take a = — 1); the 
second is false (as we prove later). (Remark: the second example is not 
unique; there are many mathematical statements which are really 
existence assertions, but which do not appear so on the surface.) 


We use the symbol 


dx 


to mean 
there exists an x such that 
or, if we want to specify that kind of x, we write for instance 


(Axe) 


to mean 
there exists a natural number x such that 


(Here VY denotes the set of natural numbers, and ‘€.VW” means ‘is an 
element of ’. See later for a discussion of basic set theory.) 
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For instance, to say that the equation x? + 2x + 1 =0 has a real 
root we would write 


(4xe&)(x? + 2x + 1=0) 
(Z denotes the set of real numbers.) 


Notice that our symbol is just a back-to-front E, which denotes 
Exists, of course. 

As well as saying that certain objects exist, we also need to say that 
something holds for all x. We use the symbol 


Vx 
to mean 
for all x it is the case that... 


And if we wish to specify what sort of x we are considering, we write, 
for instance 


(VxEN) 
to mean 
for all natural numbers x it is the case that... 
For instance, to say that ah 2 is irrational we can write 
(WpeN)(VgeN \\/2 + p/a) 


To avoid proliferation of notation, we would generally abbreviate this 
to 


(Vp, qe NV )h/2 # p/a) 


Notice that V is just an upside-down A, coming from All. 


Exercise 1.3 


(1) Express the following as existence assertions: 
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(a) The equation x3 = 27 has a natural number solution. 
(b) 1000000 is not the largest natural number. 
(c) the natural number n is not a prime. 


(2) Express the following as ‘for all’ assertions: 


(a) The equation x? =28 does not have a natural number 
solution. 

(b) O is less than every natural number. 

(c) the natural number n is prime. 


The symbol 4 is called the existential quantifier; V is the universal 
quantifier. Most statements in mathematics involve combinations of 
both quantifiers. For instance, to make the assertion that there is no 
largest natural number needs two quantifiers, thus: 


(VmeN )(AneN )(n > m) 


This reads: for all natural numbers m it is the case that there exists a 
natural number n such that n is greater than m. 

Notice that the order in which the quantifiers appear is of 
paramount importance. If we switch the order in the above we get 


(dneNV )(VmeN )(n > m) 


This asserts that there is a natural number which exceeds al! natural 
numbers, which is clearly false! 


Exercise 1.4 
(1) Express the following using quantifiers: 


(a) The equationx? +a =Ohasareal root for any real number a. 

(b) The equation x? +a =0 has a real root for any negative real 
number a. 

(c) Every real number is rational. 

(d) There is an irrational number. 

(e) There are infinitely many irrational numbers. (This one 
looks quite complex.) 


(2) Let C be the set of all cars, let B(x) mean that x is British, and let 
M(x) mean that x is badly made. 
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Express the following in symbolic form: 


(a) All British cars are badly made. 

(b) All foreign cars are badly made. 

(c) All badly made cars are British. 

(d) There is a British car which is not badly made. 
(e) There is a foreign car which is badly made. 


Particularly in Analysis it is important to be able to negate 
statements involving quantifiers (and end up with the correct answer). 
Of course, one can negate any statement by simply putting a negation 
symbol in front. But what we want is a positive assertion, not a negative 
one. What is meant by ‘positive’ here should become clear from our 
examples. (Roughly speaking, a ‘positive’ statement is one containing 
no negation symbol, or else one in which the negation symbols are as 
far inside the statement as possible, without resulting in a cumber- 
some expression.) 

Let A(x) denote some property of x (e.g. A(x) could say that x isa 
real root of the equation x? + 2x + 1 =0). Suppose it is not the case 
that Vx A(x) is true. That is, assume 


= [Vx A(x)] 


Then, if it is not the case that all x satisfy A(x), what must happen is 
that at least one of the x must fail to satisfy A(x). In other words, for at 
least one x, —~ A(x) must be true. In symbols, this is 


dx[— A(x)] 
Hence — [Vx A(x)] implies 3x [— A(x)]. Now suppose 
Ax[— A(x)] 


Thus there will be an x for which A(x) fails. Hence A(x) does not hold 
for every x (it fails for the x where it fails!) In other words, it is false that 
A(x) holds for all x. In symbols, 


> [Vx A(x)] 


Thus 4x[— A(x)] implies — [Vx A(x)]. Our two implications com- 
bine now to produce the equivalence 


—[Vx A(x)] = I4x[— A(x)] 
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A similar argument shows 
— [3x A(x)] < ¥x[— A()] 
(Exercise: Provide this argument.) 


Now let us return to our original problem about the British cars. 
We wish to negate the statement 


All British cars are badly made 


Let us formulate this symbolically using the notation of 
Exercise 1.4(2). If you got part (a) of this question correct, you should 
have the formulation 


(VxeEC)[B(x)> M(x)] 
Negating this gives 
(dxeC)— [B(x)=> M(x)] 
(One common cause of confusion. Why do we not say (4x¢C)? The 
answer is that the ‘eC’ part simply tells us which kind of x we are to 
consider. Since our original statement concerns British cars, so will its 
negation.) 
Consider now the part 

— [B(x) > M(x)] 
We have seen already that this is equivalent to 

B(x) A[> M(x)] 
Hence as our negated statement (in positive form now) we get 


(axe C)(B(x) A [> M(x)]) 


In words, there is a car which is British and is not badly made; i.e. 
there is a British car which is not badly made. 

We can also obtain this result directly as follows. Ifit is not the case 
that all British cars are badly made, then it must be the case that at 
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least one of them fails to be badly made. Hence, as this argument 
reverses, the required negation is that at least one British car is not 
badly made. 

In order to negate statements with more than one quantifier, the 
idea is to handle each quantifier in turn: the effect being that the 
negation symbol jumps inwards, changing V tod and J to V as it jumps 
over. Thus, for example 


—[Vxdy Vz A(x, y, z)]<4dx — [Fy Vz A(x,y, 2) ] 
<dx Vy [Vz A(x, y, 2)] 
<dx Vy dz— A(x, y, 2) 


Exercise 1.5 


(1) Negate the following statements and put your answer into 

positive form: 

(a) All students are communists. 

(b) One of my friends does not have a car. 

(c) Some elephants do not like currant buns. 

(d) Every triangle is isosceles. 

(e) Some of the students in the class are not here today. 

(f) All of the spark plugs in my car are working correctly. 

(g) (VxeV)ByeN x ty=1) 

(h) (Vx > O)(Ay < O)(x+y=0) (x, y denote real number 
variables). 

(i) Ax(Ve > 0)(\x| < ¢) (see Chapter 2, Section 2.3 for a definition 
of |x|). 

(j) (Vx, yeN )(AzeN)(x + y = 27). 


1.4 Proofs in mathematics 


A proof in mathematics is a logically sound argument which 
establishes the truth of the statement in question. There are many 
types of ‘proofs’. Our purpose here is just to mention a few of these 
and see how they relate to our discussion of language. 

Suppose first we wish to establish the truth of an assertion 


o> 


Well, since this will be true whenever ¢ is false, by our definition, we 
need only consider the case when @ is true. That is we can assume @. 
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For the implication to be valid now W must also be true. Thus, using 
our assumption that @ is true we must now present an argument 
which demonstrates the truth of yw. This of course accords with our 
everyday understanding of ‘implication’. So, when we come to prove 
implications, there is no problem. 

For example, let us prove the statement 


(x and y are rational numbers)=(x + y is a rational number) 


Assume x and y are rational numbers. Then we can find integers 
P, q, m,n such that x = p/m, y =q/n. Then 


pn+qm 


xty=247— 
m on mn 


Hence, as pn + qm and mn are integers, we conclude that x + y is a 
rational. The statement is proved. 


Implications involving quantifiers are often best handled by using 
the equivalence of ¢ > w with (— )=>(— ¢). Suppose, for instance, we 
wish to prove the implication 


(sin 0 # 0)=>(VneV)(0 # nz) 
This statement is equivalent to 
[— (VneV)(0 # nz)] => — (sin 6 #0) 
which reduces to the positive form 
(dne.V )(0 = nx)=> (sin 8 = 0) 


which is an implication we know to be correct. This proves the 
original implication by virtue of what equivalence means: in order to 
prove a statement it is enough to prove any equivalent statement. 


Another common method of proof is the so-called method of proof 
by contradiction (reductio ad absurdum). This is essentially a disguised 
version of the last type of proof. We wish to prove some statement @ 
but cannot see how to begin. So we assume — ¢ and proceed to deduce 
some obviously false statement (e.g. 0 = 1). Since our conclusion is 
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false it follows (assuming our reasoning is sound) that our initial 
assumption of — @ has to be false. Hence @ is shown to be true. This 
relates to the above method of ‘proving an equivalent statement’ as 
follows. Let F denote any false statement, T any true statement. We 
assume — @ and deduce F. So we prove the implication 


(+ ¢)=>F 
But this is logically equivalent to 
(—F)=[-(-9)] 
Le. to 
T+ 
Hence our proof also establishes the implication T= @. But T is true, 
so the truth of the implication means that @ is true, as required. 


A good illustration of the method of proof by contradiction is 
furnished by the following result, promised earlier. 


Theorem 1.1 
ve 2 is irrational. 
Proof 


Assume, on the contrary, that ,/ 2 were rational. Then we can find 
natural numbers p and q such that 


/2= pia 


We may assume that any common factors in p, g have been cancelled 
out. Squaring gives 
2 = p*/q? 


Thus 
p? =2q? (1.1) 


Thus p” is even. Hence p must be even, since (odd)? = (odd). Thus 
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there is a natural number r such that p = 2r. Substituting in Equation 
(1.1) gives 


4r? = 2q? 
Cancelling 2 gives 
2r? =4q 


Thus q? is even. Hence q must be even. But p is even and p and q have 
no common factors, so we have a contradiction. Hence our original 
assumption that ,/ 2 was rational must be false. In other words, ,/ 2 
must be irrational. QED. 


Proving existence assertions is often straightforward. In order to 
prove the statement 


dx A(x) 


It is enough to find one x which satisfies A(x). For instance, if we wish 
to prove that 


there exists an irrational number 


it suffices to quote the above theorem. Not only does this tell us that 
an irrational number exists, it provides us with a specific example. 
This is not always the case. There are many instances in mathematics 
where one knows that, say, a real number exists which satisfies a 
certain property, but one has no idea what such an x looks like, or 
even whether it is positive or negative. Indeed, advanced mathematics 
abounds with examples. We content ourselves here with a trivial one. 
Let A be the statement that in the decimal expansion of x the digit 3 
appears infinitely many times. Consider the existence assertion 


dn[A > n=0)A(—A =>n=1)] 


We can prove this as follows. Either A or — A. If A, then n = 0 suffices. 
If — A, then n = 1 suffices. Hence, as 0 and 1 certainly exist, we have 
proved the existence statement. But at the present state of our 
knowledge we do not know whether or not A is true, so we cannot say 
which of 0 or 1 satisfies the statement. (Clearly, they cannot both 
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satisfy it!) It is possible that one day a proof of A (or of — A) will be 
found, in which case we will have our example. But a proof of A or of 
— A would doubtless be rather deep, and certainly much less trivial 
than our above existence proof. 


Finally, how does one prove a statement of the form 
Vx A(x)? 


One possibility is to take an ‘arbitrary’ x and show that it must satisfy 
A(x). For instance, suppose we wish to prove the assertion 


(VneV)(Ame.V)(m > n?) 


We can do this as follows. Let n be an arbitrary natural number. Then 
n? is a natural number. Hence m =n? + 1 is a natural number. But 
m > n?, so we have shown that 


(Ame.V)(m > n?) 


This is a proof because our original n was quite arbitrary: we said 
nothing at all about n: it could be any natural number: hence the 
argument is valid for all ne VY. This is not the same as picking a 
particular n. If we had chosen, say, n = 37, the proof would not have 
been valid - even though we had chosen 37 quite at random. For 
instance, suppose we wanted to prove 


(Vne.V)(n? = 81) 


By picking at random a particular n we might be unlucky enough to 
pick n = 9. But this does not prove the statement of course, because 
our choice was an arbitrary choice (albeit an unlucky one) of a 
particular n, and not a ‘choice’ of an arbitrary n. When we say ‘let n be 
arbitrary’ we use the symbol n throughout, and assume also that the 
‘value’ of n remains constant throughout, but we make absolutely no 
restriction on what the value of n is. 


There are other possibilities. Statements of the form 


(VneEV)A(n) 


USE OF LANGUAGE IN MATHEMATICS 25 


are often proved by induction. The idea here is to first prove A(1), and 
then prove the statement 


(Vne.V)[A(n)=> A(n+ 1)] 
That this proves (Vne./) A(n) may be reasoned as follows. We proved 
A(1). But we know that A(1)= A(2) is true. Hence A(2) is true. But 
A(2)= A(3) is true. Hence A(3) is true. And so on right through the 


natural numbers. As an example let us use the method of induction to 
prove a simple theorem. 


Theorem 1.2 
If x > 0 then for any ne, 
(L+x)"*!>14(n+1)x 


Proof 
For n=1 the binomial theorem gives 


(1+x)"*t=(14+x)?=14+2x+x?>14+2x=14+(n+1)x 


(since x? > 0). Hence the statement is true for n = 1. We now wish to 
prove the implication 


(VneV)[A(n)=> A(n+ 1)] 
where A(n) is the statement 
(1+x)"*!>14(n4+1)x 


To do this we take an arbitrary (!) n in W. We must prove the 
implication 


A(n)= A(n + 1) 
To do this we assume A(n) and try to deduce A(n+ 1). We have 


(1+x)"*2?=(14+x)"* (1+) 
>(1+(n+1)x)1+x) [by A(n)] 
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=1+(n+l)x+x+(n+ Lx? 
=1+(n+2)x +(n + 1)x? 
>1+(n+2)x 


This proves A(n + 1). (Question: The assumption x > 0 was used in the 
above argument. Where?) Hence by induction the theorem is proved. 


The following summarizes the method of proof by induction. You 
wish to prove that some statement A(n) is valid for all natural 
numbers n. First you establish A(1). This is usually just a matter of 
trivial observation. You then present an algebraic argument which 
establishes the implication 


A(n)=> A(n+ 1) 


for any n. In general, you do this as follows. Assume A(n). Look at the 
statement of A(n + 1), and somehow try to reduce it to A(n), which has 
been assumed true, and thereby deduce the truth of A(n+ 1). This 
having been accomplished, the induction proof is complete. 

In setting out an induction proof formally, three points should be 
remembered: 


(1) State clearly that the method of induction is being used. 

(2) Prove the case n=1 (or at the very least make the written 
observation that this case is obviously true). 

(3) (the hard part) Prove the implication 


A(n)=>A(n+ 1) 


Let us look at one further example to illustrate the above points. We 
prove the following well known result from algebra. 


Theorem 1.3 
Every natural number greater than | is a product of primes. 


(Remark: At first it might seem that the statement to be proved by 
induction is 


A(n):n is a product of primes 


However, as will shortly become clear, it is more convenient to prove 
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by induction the statement 


B(n): every natural number m such that 1<m<n 
is a product of primes 


So here goes; and from now on we shall set out our proof in a correct 
fashion.) 


Proof 
We prove, by induction, that B(n)is true for all natural numbers n > 1, 
where B(n) is the statement 


all natural numbers m such that 1<m<n 
are products of primes 


This clearly proves the theorem. 

For n = 2, the result is trivial: B(2) holds because 2 is prime. (Notice 
that in this case we must start at n = 2 rather than the more common 
n= 1, for obvious reasons.) 

Now assume B(n). We prove B(n + 1). Let m be a natural number 
such that 1<m<n+1. If mn, then by B(n), m is a product of 
primes. So in order to prove B(n + 1) we need only show thatn+ lisa 
product of primes. If n + 1 is a prime there is nothing further to say. 
Otherwise, there are natural numbers p, q, such that 


l<p,q<n+l 
and 
n+1l=pq 
Now p, g <n, so by B(n), p and q are products of primes. But then 
n+1=pq is a product of primes. This completes the proof of 
B(n + 1). 


The theorem is thus proved by induction. 
(End of proof.) 


Of course, in the above example, the implication 


B(n)=> B(n + 1) 


28 SETS, FUNCTIONS AND LOGIC 


was rather easy to establish. (Indeed, we considered B(n) rather than 
the more ‘obvious’ A(n) mentioned earlier precisely in order to carry 
through this simple argument.) In many cases, real ingenuity is 
required. But do not be misled into confusing the proof by induction 
of the main result, (Vne.W)A(n), with the technical subproof of the 
implication (Vne.V)[A(n)=> A(n+ 1)]. Without an announcement of 
the fact that induction is being used and an observation or proof that 
A(1) is valid, any amount of technical cleverness at proving A(n)=> 
A(n+ 1) is worthless [as a proof of the result (Vne.) A(n)]. 


So much for induction. One last remark concerning proofs of 
statements of the form 


Vx A(x) 
If all else fails, there is the method of contradiction. By assuming 
—Vx A(x) we get an x such that — A(x) (because — Vx A(x) is 


equivalent to 3x — A(x)). Now we have a place to start. The difficulty 
is finding the finish (i.e. the contradiction). 


Exercise 1.6 


(1) The symbol 


is a common abbreviation for the sum 


a,+da,+a3,+...+4, 


For instance, 


denotes the sum 
14+274+374...4n? 


Prove the following by induction: 
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(a) Vnew: 3 rasnin +1) 


r=1 


(b) Vnew: 3 r? = intn + 1)(2n + 1) 


r=1 


(2) Prove by the method of contradiction that if x and y are real 
numbers such that x + y is irrational, then at least one of x, y is 
irrational. Does it follow that if x + y is rational, then at least one 
of x, y is rational? 

(3) Prove that ,/ 3 is irrational. 

(4) Prove that there exist real numbers x and y such that x + y=y. 

(5) Prove that there are infinitely many natural numbers n such that 
yf nis irrational. How would you express this fact symbolically (no 
words allowed)? 


1.5 Mathematical truth 


The notion of ‘truth’ is one which has occupied philosophers for 
generations, and no doubt will continue to do so, and it is not our 
present intention to enter into this rather bewildering area. However, 
we should say something about what a mathematician means when 
he says some mathematical statement is true (or false). This requires a 
little knowledge of the development of mathematics. 

All mathematical concepts have their origin in the everyday world 
around us. For example, the natural numbers arise from our desire (or 
need) to count collections of objects, the rational numbers are 
required in order to measure lengths, geometry and trigonometry 
assist us in navigation, and so on. At the initial stage ‘truth’ means 
‘experimentally verifiable’. Thus we can demonstrate the ‘truth’ of the 
assertion 


14+2=3 


by taking one apple and two oranges and noting that there are three 
items altogether. But what about the assertion 


1000000000 + 5 = 1000000 005? 


Most of us would (I hope) agree that this statement is just as ‘true’ as 
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the previous one, though it is clearly out of the question to verify this 
fact by counting. (If you disagree with this last claim, just add a few 
more significant zeros.) So why do we so readily agree that the 
equation is valid? Well, although the concepts of natural number and 
addition arise out of everyday experience, they are really just idealized 
concepts — figments of our imagination — whose properties and be- 
haviour are postulated by us to correspond to natural phenomena. 
(None of us has ever seen ‘the number 10°. We may have seen ten 
apples in a bowl. And we have just seen two symbols on a page which 
we read as ‘ten’. But ‘the number 10’ itself remains locked away 
somewhere in our psyche.) The equation 


1000 000 000 + 5 = 1000000 005 


is ‘true’ because it is a consequence of the properties which we 
postulate for the natural numbers. Of course, the natural numbers 
and addition are so very basic and familiar to us, that it is rare to 
regard matters in quite this fashion (‘rare’, notice, not ‘incorrect’). But 
what about complex numbers? The equation 


(1 +i)? =2i 


is ‘true’, but we cannot test this by experiment — even a hypothetical 
experiment! In fact it is ‘true’ only in the sense that it can be proved to 
be a consequence of the properties which we postulate the complex 
numbers to have. 

This brings us to the real nature of pure mathematics. In each area, 
be it the theory of addition of integers, the theory of real numbers, 
geometry, calculus, or whatever, we begin with a collection of 
postulates (or axioms), setting down the properties the objects under 
consideration are to have. These postulates are formulated either by 
observation of some everyday event (as with the postulates for 
addition of integers or for classical Euclidean geometry), or else by 
consideration of what properties the system ‘ought’ to have (as with 
the postulates for real numbers — see Chapter 3 - or for complex 
numbers). From then on ‘truth’ means simply ‘provable from the 
postulates’. (There are also axioms for logical deduction — so the 
notion of just what constitutes a ‘proof’ is also determined by means 
of postulates, but at this stage you are probably best advised to stick 
to whatever intuitive notions you have already regarding the nature 
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of proof.) In the case of the natural numbers or the rational numbers, 
the basic postulates are often never cited at the first year under- 
graduate level, so you may have some difficulty in appreciating the 
above remarks. But it is extremely likely that your Analysis course 
will commence with some discussion of the postulates for the real 
numbers, and then you will see how ‘true’ corresponds to ‘provable 
from the axioms’. (We discuss the real numbers briefly in Chapter 3.) 

Our only concern now is that when you are next asked to prove 
something, you will be puzzled as to where you should start. 
Especially as you don’t know ‘all the axioms’. Don’t worry! (In fact, 
you probably will, but what more can I say?) Except in unusual 
circumstances, a ‘proof’ of something is simply a logical argument 
which convinces us that the fact is true (for the system concerned). The 
‘proof’ given earlier that ,/ 2 is irrational is valid in this sense. True 
enough, this ‘proof’ used various properties of the natural numbers 
(such as the square of an odd number being odd), which are only ‘true’ 
because they follow from the postulates for the integers. But we all 
‘know’ that (odd)? = odd, so in this case there is no need to go right 
back to the axioms to check it. As a general rule: let common sense be 
the judge of what is or is nota proof. After a little while you should have 
no difficulties in deciding what you may or may not assume in any 
given instance. (Though even the experts can disagree on this matter — 
and frequently do! A favourite pastime amongst mathematicians— 
not unlike cock-fighting—is to engage an analyst and a differential 
equations expert in a ‘discussion’ as to where the former should begin 
his lecture course: by introducing the postulates for the real number 
system and deducing everything from them, or by assuming the 
students ‘know’ enough about the real numbers already and building 
on this knowledge.) 


CHAPTER 2 


Sets and functions 


2.1 Sets 


The concept of a ‘set’ is extremely basic and pervades the whole of 
present day mathematical thought. Any well-defined collection of 
objects is a set. For instance we have: 


the set of all students in your class 
the set of all prime numbers 
the set whose members are you and my left foot 


So long as we have some way of specifying the collection, then we say 
it is a set. Our last example above has already made use of the notion 
of‘membership’. If A is a set, then the members of the collection A are 
called either the members of A or the elements of A. (With a concept as 
simple as a set, there is no way to avoid circular definitions: but we all 
know what is meant.) We write 


xeA 


to denote that x is an element of A. 
Some sets occur frequently in mathematics, and it is convenient to 
adopt a standard notation for them: 


AN: the set of all natural numbers (i.e. the numbers 1, 2, 3, etc.) 
#: the set of all integers (ic. the positive and negative whole 
numbers) 
2: the set of all rational numbers (i.e. ‘fractions’) 
&: the set of all real numbers 
@: the set of all complex numbers 


Thus, 
xEeR 
32 


SETS AND FUNCTIONS 33 
means that x is a real number. And 
(xeE2) A (x > 0) 


means that x is a positive rational number. 

There are several ways of specifying a set. Ifit has a small number of 
elements we can list them. In this case we denote the set by enclosing 
the list of the elements in curly brackets; thus, for example, 


{1, 2, 3, 4, 5} 


denotes the set consisting of the natural numbers 1, 2, 3, 4 and 5. 
By use of ‘dots’ we can extend this notation to any finite set; e.g. 


{1, 2, 3,...,n} 
denotes the set of the first n natural numbers. Again 
{2, 3, 5, 7, 11, 13, 17,..., 53} 
denotes the set of all primes up to 53. 
Certain infinite sets can also be described by the use of ‘dots’ (only 
now the dots have no ‘end’); e.g. 
{2, 4, 6, 8,..., 2n,...} 
denotes the set of all even natural numbers. Again, 
{..., -8, —6, —4, —2,0, 2,4, 6, 8,...} 


denotes the set of all even integers. 

In general, however, infinite sets (and many finite ones too) are best 
described by giving the property which defines the set. If A(x) is some 
property, the set of all those x which satisfy A(x) is denoted by 


{x|A(x)} 


Or, if we wish to restrict the x to those which are members of a certain 
set X, we would write 


{(xeX|A(x)} 
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This is read ‘the set of all x in X such that A(x/y. 
For example: 


N ={xeF|x > 0] 
2={xEk|(AmneF)((m > 0) A (mx = n))} 

{/2, —./2} = {xeR|x? = 2} 

{1, 2,3} = {xeW |x < 4] 


Two sets, A, B are equal, written A = B, if they have exactly the same 
elements. As the above example shows, equality of sets does not mean 
that they have identical definitions; there are often many ways of 
describing the same set. The definition of equality reflects rather the 
fact that a set is just a collection of objects. 

If we have to prove that the sets A and B are equal, it is often quite 
difficult to prove in one go that they have the same elements. What is 
usually done is to split the proof into two parts: 


(a) Show that every member of A is a member of B. 
(b) Show that every member of B is a member of A. 


Taken together, (a) and (b) clearly imply A = B. (The proof of (a), (b) is 
usually of the ‘take an arbitrary element’ variety. To prove (a), for 
instance, we must prove (VxeA)(xeEB); so we take an arbitrary 
element x of A and show that x must be an element of B.) 


Exercise 2.1 


(1) Let 
P={neN |(n> 1) a (Vx,yeV)\(xy =n=>(x=1 Vv y=1))} 


What well-known set is P? 
(2) Let 
P= {xeA|sinx =0}, O = {nn|neX} 
What is the relationship between P and Q? 
(3) Let 
A= {xeA@|(x > 0) A (x? =3)} 


Give a simpler definition of A. 
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(4) Let 
A= {o,t, fi s,e, n} 


Give an alternative definition of the set A. (Hint: this is connected 
with 4”, but is not entirely mathematical.) 


The set notations introduced have obvious extensions. For in- 
stance we can write 


2={m/n|m, ne¥,n +0} 


and so on. 


It is convenient in mathematics to introduce a set which has no 
elements: the empty set (or null set). There will only be one such set, of 
course, since any two such will have exactly the same elements (!) and 
thus be (by definition) equal. The empty set is denoted by the 
Scandinavian letter 


O 
The empty set can be ‘defined’ in many ways; e.g. 
@ ={xeR|x? <0} 


O={xeV|1 <x <2} 
D = {xx # x} 


Notice that @ and {@} are quite different sets. @ is the empty set: it 
has NO members. {@} is a set which has ONE member. Hence 


DFO} 
Indeed, the correct relation here is 


DED} 


(The fact that the element of {@} is the empty set is irrelevant in this 
connection: {@} does have an element, @ does not.) 


A set A is called a subset of a set B if every element of A is a member 
of B. For example, {1, 2} is a subset of {1, 2, 3}. We write 
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ACB 


to mean that A is a subset of B. If we wish to emphasize that A and B 
are unequal here, we say that A is a proper subset of B and write 


AcB 


(This usage compares with the ordering relations < and < on &.) 
Clearly, for any sets A, B, we have 


A=B iff (AS B)a (BSA) 
Notice also that for any set X, 
Ox 


Exercise 2.2 


(1) List all subsets of the set {1, 2, 3, 4}. 

(2) List all subsets of the set {1, 2, 3, {1, 2}}. 

(3) Prove (by induction) that a set with exactly n elements has 2” 
subsets. (The set ofall subsets of a set X is called the power set of X, 
denoted by A(X).) 


2.2 Operations on sets 


There are various natural operations we can perform on sets. (They 
correspond roughly to addition, multiplication, and substraction for 
integers.) 

Given two sets A, B we can form the set of all objects which are 
members of either one of A and B. This set is called the union of A and 
B and is denoted by 


AUB 
Formally, this set has the definition 
AUB = {x|(xeEA) v (xe B)} 


(Now the reader gets his first real inkling as to why we defined our 
precise use of ‘or’ to mean the ‘inclusive-or’.) 
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This can be illustrated diagrammatically as follows. We first of all 
introduce the notion of a universal set. It is rare in mathematics to 
discuss ‘arbitrary sets’. One is generally only interested in sets of reals, 
or sets of integers, or so on. Now, when discussing sets of reals, it is 
convenient to imagine that there is nothing else in the world except 
reals. For one thing, this prevents our having to keep saying ‘x is a 
real’. When we do this, we announce the fact by the rather pompous 
phrase, ‘Let Z be the universal set for the discussion’. This means that 
any set mentioned in the ensuing discussion will be assumed to be a 
subset of @. Likewise if we fix # as universal set, or .”, etc. Thus, 
there is not one universal set: rather we introduce one such whenever 
it is convenient to ignore all the possibilities other than the ones 
concerned. (If this is still confusing to you, it should become clearer 
when we look at set-theoretic complements in a short while.) And now 
to our diagram. Fix some universal set U. We represent U by means of 
a rectangle on the page. The elements of U are the ‘points’ within the 
rectangle. So, relative to our fixed universal set, all ‘objects’ lie inside 
our rectangle. Sets (i.e. subsets of U) are denoted by encircling regions 
within U. 

In Fig. 2.1, the set A is represented by the interior of the left-hand 
circle, the region shaded horizontally, and the set B is represented by 
the interior of the right-hand circle, the region shaded vertically. The 


U 


Figure 2.1 Union and intersection 
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set AUB is represented now by the total shaded region (which 
includes a region which is shaded two ways here). Such a diagram- 
matic representation of sets is called a Venn diagram. 


Exercise 2.3 


(1) What is the relationship between A and B illustrated by the Venn 
diagram in Fig. 2.2? 
(2) Draw a Venn diagram to illustrate the fact 
AUBSC 
(3) Draw a Venn diagram to illustrate the fact 


AGBUC 


The intersection of the sets A, B is the set of all members which A and B 
have in common. It is denoted by 


ANB 
and has the formal definition 


AB = {x|(xEA) A (xeEB)} 


Figure 2.2 Inclusion 
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In Fig. 2.1, the set A B is represented by the region which is doubly 
shaded. 

Two sets A, B are said to be disjoint if they have no elements in 
common: that is, if ANB=@. 


Exercise 2.4 
(1) For each of the following pairs of sets A, B, find AUB and ANB: 


(a) (ab, c},ta def} 
(b) {xEB&|x <0}, {xe#|x 20} 
(c) {1, 2, U1, 2, 3}}, U1, (1, 233 


(2) Prove the following (for any sets A, B): 
ANBGCACAUB 


(Drawing a Venn diagram does not constitute a proof. It 
illustrates the fact, but does not prove it. For a proof, a logical 
argument is needed.) 

(3) Prove the following statements: 


ANGD=B 

AUQG=A 
ANB=BonA 
AUB=BUA 


ANA=AVUA=A 


(4) Prove that if A, B, C are sets, then 


(a) AA(BAC)=(AOB)AC 
(b) AU(BUC)=(AUB)UC 


Illustrate these results by means of a Venn diagram. 
(5) By considering the sets A = &, B = 2, C = set of irrationals, show 
that it is not always the case that 


(AUB)NC =AVU(BNC) 


(6) Show that the equality (AN B)UC = AnN(BUC) is not always 
valid. 
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Figure 2.3 Complement 


Once we have fixed a universal set we can introduce the notion of 
the complement of the set A. Relative to the universal set U, the 
complement of a set A is the set of all elements of U which are not in A. 
This set is denoted by A’, and has the formal definition 


A’ = {xeU|x€éA} 


(Notice that we write x¢A instead of —(xeA), for clarity.) 
In Fig. 2.3, A’ is represented by the unshaded region. 


The following theorem sums up the basic facts about the three set 
operations just discussed. 


Theorem 2.1 


Let A, B, C be subsets of a universal set U. 


(1) AU(BUC) =(AUB)UC 
(associative property of union) 
(2) AN(BNC)=(ANB)NC 
(associative property of intersection) 
(3) AUB=BUA 
(commutative property of union) 
(4) ANB=BoA 
(commutative property of intersection) 
(5) AU(BNC) =(AUB)n(AUC) 
(distributive property for U over 4) 
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(6) AN(BUC) =(ANB)U(ANC) 
(distributive property for ™ over UV) 
(7) (AUB) =A'OB’ 
(8) (ANB) =A'UB’ 
((7) and (8) are called the de Morgan laws) 


(9) AUA'=U 
(10) ANA'=@ 
(11) (AY =A 


(idempotence of complement) 


Proof 
The proof of parts (1) to (4) and of parts (9) to (11) are left as an exercise 
(some have already been given in Exercise 2.4). In each of the 
remaining cases we take the statements in pairs and produce a Venn 
diagram illustration for the one and a rigorous proof for the other. 
Consider part (5). We can represent this diagrammatically as in 
Fig. 2.4. We have drawn the diagram to illustrate the most general (i.e. 
the most complex case), which is when all the sets have elements in 
common with each of the other sets. We have also numbered each of 
the regions. Now let us see which regions correspond to which sets. 


Figure 2.4 Diagrammatic representation of the distributive law 
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Well, 


A is the regions 1, 2, 3, 4 


and 
BoC is the regions 3, 6 (common to B and C) 
So, 
AU(BOC) is the regions 1, 2, 3, 4, 3, 6, ie. 1, 2, 3, 4, 6 

Again, 

AUB is the regions 1, 2, 3, 4, 6,7 
and 

AUC is the regions 1, 2, 3, 4, 5, 6 
so 


(AUB)n(AUC) is 1, 2, 3, 4, 6 (common to both) 


But this is the same region that we obtained for AU(BOC) above. 
This illustrates (5). We leave it to the reader to provide a formal proof 
(which he can model on our proof of (6) below). 

Consider part (6). Here is a rigorous proof. Let 


D=An(BUC) 
E=(ANB)U(ANC) 


We prove first that DC E. Let xeD. Then xe A and xe BUC. Since 
xeEBucC, either xeB or xEC, or both. In case xe B, we have xe A and 
x€B,so xe AB. On the other hand, if x¢B, then we must have xEC, 
so xe€A and xeEC, giving xe AMC. In either of these cases, xe E. Hence 
DCE. Now we prove E ¢ D. Let xeE. There are two cases. Suppose 
first that xe A B. Then xeA and xeB,soxeA and xeBUC,soxeD. 
On the other hand, ifx¢A™ B, then xe A NC, so again we obtain xe A 
and xeBUC, giving xeD. Hence ECD. 
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Figure 2.5 Diagrammatic representation of the de Morgan laws 


Since D¢ E and ECD, it follows that D = E, which proves (6). 

We turn now to (7) and (8). We illustrate (7) by means of a diagram 
(Fig. 2.5), and leave it to the reader to provide a rigorous proof. 

Clearly, A UB is regions 1, 2, 3. Thus, (A U B)’ is region 4. (the only 
remaining region). 

Again, A’ is regions 3, 4, and B’ is regions 1, 4,so A’ B’ is region 4 
(the common region); this illustrates (7). 

Now we prove (8). Let xe(AB)’. Thus x€ AN B. Hence 

—((xEA) A (xeEB)) 
This is the same as 
(—(xeEA) v —(xeEB)) 


In other words 


(x€A) v (x¢B) 


(xe A’) v (xe B’) 


xe A’ UB’ 
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This shows that (A7 B)' € A' UB’. The above steps may be reversed 
to give A‘UB’ <(ANB)’, thereby completing the proof. 


Exercise 2.5 


(1) Illustrate the proofs of parts (6) and (8) of Theorem 2.1 by means 
of a Venn diagram. 
(2) By using the set identities established in the theorem, show that 


[AU(BAC)]AC =(AUB)AC 


(What is required here is an algebraic proof, not a set-theoretic 
one.) 
(3) Use the distributive laws to show that 


(AUB)A(CUD) =(ANC)U(AND)U(BNC)U(BOD) 


Does the result remain valid if we interchange nN and v here? If so, 
justify this by an argument which does not involve a direct 
calculation involving the sets concerned. If not, give an example 
to justify your answer. 


2.3 Functions 


The concept of a function from real numbers to real numbers will be 
familiar to you already. For instance the equation 


y=x?+x41 


determines a function from real numbers to real numbers. We can 
draw its graph as in Fig. 2.6. 

We are able to plot the graph of this function because, for each 
value of x in # we can calculate the corresponding value of y: we just 
substitute the value of x concerned into the equation. So what the 
above equation does is provide us with a general rule for calculating 
values. (After all, x and y are just variables, not specific numbers.) 

We can thus generalize the concept of a function as follows. Let A 
and B be any non-empty sets. A function from A to B is a rule which 
associates with each member of A a unique member of B. We make no 
restrictions on the rule: the only crucial point is that the element of B 
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Figure 2.6 Graph of the function y=x?+x+1 


which we associate with a given member of A must be unique. Now, in 
the case of a function as above given by an equation, we can refer to 
that function by giving its equation. But our definition allows for all 
kinds of ‘rules’. Hence it is convenient to introduce some notation. 

Suppose we have a rule which associates with each member of the 
non-empty set A a unique member of the set B. Let f denote this rule. 
Then we say f isa function A to B. We write 


f:A-B 


to abbreviate this last sentence. Ifae A, we denote the unique element 
of B which the rule associates to a by 


f(a) 
Thus, in our original example above, f :2- and 
f(x)=x?+x41 


(This last equation reads: the value of the function f at x is the real 
number x? + x + 1.) 

Notice that we have not defined the concept of ‘a function’, but 
rather ‘a function from a set A to a set B’. This distinction should be 
borne in mind at all times, as it affects our subsequent definitions to a 
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Inputs Black box Possible outputs 


Figure 2.7 The three parts of a function 


very great degree. One can think ofa function as a sort of ‘black box’ 
which has an input hole and an output hole (Fig. 2.7). At the input 
hole there is another box full of possible inputs, and at the output hole 
there is a further box designed to take all the ‘possible’ outputs. 
We may feed into the black box any of the members of A. The black 
box f then processes our input and produces a single output, which 
will always be a member of the set B. 

Now, it may be that if two different inputs a,,a, are made, the same 
output occurs in each case, i.e. f (a,) = f (a,). If this never happens, we 
say f is one-one, or injective, or an injection. Thus, f is one-one if 
distinct inputs produce distinct outputs. 

Now let’s look at the set B of ‘possible’ outputs. We have enclosed 
the word ‘possible’ in quotation marks for the following reason. The 
output box is an integral part of our black box apparatus (it is not 
possible to purchase a black box on its own). But it is possible that 
some of the space in B is not needed: the black box will never be able 
to fill the box B. But if it is possible to fill box B, we say f is onto or 
surjective, or a surjection. 

Let us now see what these definitions mean in relation to our 
original example of the rule determined by the equation 


y=x?4+x4+1 


This determines a function f:2-& given by f(x)=x?4+x4+1. 
Is f one-one? That is, can we find distinct reals x, and x, such that 
f (x,) = f(x.) (in which case it will not be one-one), or is it the 
case that 


(Vx4, X2EA)(Xy #X2—> f (1) # f(X2)) 


(in which case it will be one-one). A quick glance at Fig. 2.6 shows us 
that 


J(-Y=f OH! 
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Hence f :2-—2 is not one-one. Is f onto? That is, does every 
element of Z occur as a value of f for some input? Again, the diagram 
(Fig. 2.6) provides the answer. The number 0 is never a value of f. 
(Indeed, no number less than 0.75 is a value of f) Hence f : 2 Z is 
not onto. 

But the same equation y = x* + x + 1 also determines a function 
g:R2* >&, where R* = {xE#|x 20}, the set of all non-negative 
reals. The only difference between this function and the previous one 
is that our new function has a different ‘input box’; we are no longer 
allowed to feed negative numbers into our ‘black box’. The graph of 
this function is illustrated in Fig. 2.8. 

The function g: 2* +2 so defined is clearly one-one: it is indeed 
increasing: ifx, < x,, then g(x,) < g(x,). But it is still not onto, for 0 is 
still not a value, and neither is any real less than 1. 

Finally, let h : 2* > A be the function h(x) = x? + x + 1, where now 
A= {xe@&|x 2 1}. This function is both one-one and onto. 

A function which is one-one and onto is sometimes called a 
bijection. 

Let us now briefly review our definitions. A function from a 
non-empty set A to a set Bisa rule which associates with each element 
a of A a unique element f(a) of B, called the value of f at a. We write 
f :A—B in this case. The set A is the domain of f, the set B the 
codomain. When a function is specified, it is not sufficient to give 


y 


-1 


Figure 2.8 Graph of the function y= x?+x+1 for x20 
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the rule: the domain and codomain must also be given. The function 
f :A—B is one-one (injective) if distinct elements of A give rise to 
distinct values in B. And f is onto (surjective) if every element of B 
is a value of f: A function which is both one-one and onto is called a 
bijection. 

Functions are also called mappings, maps or transformations 
(though the last word tends to be reserved for special kinds of 
functions). 

Now let us consider some examples of functions, other than the 
familiar example of function given by an equation. 


(1) Consider the function f : 2-2 defined by 


x, ifx20 
foy=}_* ifx <0 
This function is not defined by an equation: rather the rule consists of 
two separate cases. In words, the rule is: given x as input, the output is 
x if x 20 and is — x if x <0. Since for each x there is exactly one 
possible f (x) (because no x is at the same time 2 0 and < 0), this does 
define a function. It is the absolute value function. We usually write 
|x| instead of f(x), and call |x| the absolute value of x, or the modulus 
(or mod) of x. This function is not one-one, since, for example, 
f(—1=f(1) =1. It is not onto, since no negative real is a value. 
(If we were to consider instead the function g:#->* defined by 
g(x) = |x|, then g would be onto, but still not one-one). 
(2) Consider next the function f:2— WV defined by 


1, if x is irrational 


joel 


0, if x is rational 


The function f :2— NW so defined is neither one-one nor onto. 
(3) Let A = {a, b, c,d}, B= {c, d, e}. Define f : AB by 


f@=c, flb)=d, fid=c, f(d)=e 


(In other words, the ‘rule’ is determined by listing all the values 
individually.) The function f so defined is not one-one, but it is onto. 

(4) Let A be the set of all countries, and let B be the set of all capital 
cities in the world. Define f : A — B by letting f(a) be the capital city 
of a. The function f is a bijection. 
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(5) Let f:2->F be defined by 


f (x) =the largest integer n such that n <x 


We call f(x) the integer part of x. This function is not one-one, but it is 
onto. 
(6) Let A be any non-empty set. Define 14: A— A by 


I,(a)=a_ (VaeA) 


We call I, the identity function on A. It is clearly bijective. 


Exercise 2.6 


(1) Let A = {1, 2}, B= (1, 2, 3}. List all the functions from A to B. 
(2) List those functions in question (1) which are 
(a) one-one (b) onto (c) bijective. 
(3) Define f : 2 by f(x) =4(x + |x|). Evaluate f(0), f(1), f (2). Is 
f one-one? Is f onto? Justify your answers. 
(4) Define f : 2-2 by 


x? +1, ifx20 
x—-lifx<0 


joel 


Prove that f is one-one. Is f onto? 
(5) Define f:“—> WV by 


2n, if n is even 
n, if n is odd 


finy=4 


Show that f is one-one. Is f onto? 

(6) Let A = {1, 3, 5, 7,...}, the set of odd natural numbers, and B 
= {2, 4, 6, 8,...}, the set of even natural numbers. Give examples 
of functions from A to B which are: 


(a) One-one but not onto. 

(b) Onto but not one-one. 

(c) Neither one-one nor onto. 
(d) One-one and onto. 


Let f :A—B.IfX ¢ A, the range of f on X is the set of all values of 
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f for inputs from the set X. In symbols 
(f)|xeX} 
or, alternatively, 
{beB\(dxeX)(f (x) = b)} 
The range of f on X is denoted by 
SLX] 


We refer to the set f [A] simply as the range of f. 


Exercise 2.7 


(1) Let f :A—B. Show that f is onto iff f[A]=B. 
(2) Define f:Y%—>W by 


2, if n is not prime 
f(n -| Poona. 
n, if n is prime 


Let P be the set of all primes, K the set of all non-primes (in /). 
Identify the sets f[V], f[P], f LK]. 

(3) Let f :A—B. Show that f is not one-one iff there are elements 
x, yEA such that x #y and f[{x, y}] has only one element. 

(4) Define f: 2-2 by f(x)=x?+x+1. What is f[#]? What is 
ftary 

(5) Let f:4%—MN. Show that there is a set ASW such that 
h:A — WM is one-one, where we define 


h(n)=f(n) — (WneA) 


Now show that there is a set B&W such that g:A—B is a 
bijection, where we define 


g(n)=h(n) — (Wne A) 


We say two functions f, g from a set A to a set B are equal iff for 
every aeA, f (a) = g(a). Thus, equality of functions depends upon the 
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domains being equal, the codomains being equal, and the actions 
being equal. We do not require that the ‘rules’ are the same. For 
instance, f:2-& and g:#—-&Z are equal, where we define 


J (x) =maximum (x, — x) 
g(x) = |x| 


2.4 Composition of functions. Inverse functions 


Suppose f:A—B and g:B-—C are given. Then we can define a 
function h: A>C by the following rule: given aéA, use rule f to 
calculate b = f (a), and then use rule g to calculate g(b) for this b; the 
resulting element of C (being uniquely defined) shall be h(a). In 
symbols, 


h(a)=g(fla)) (Wa A) 


We denote the function h so defined by go ff and refer to this function 
as the composition of g and f: Notice that for g° f to be defined, the 
codomain of f should be equal to the domain of g. 


Exercise 2.8 
(1) Let f:A->B, g:B—>C,h:C—D. Prove that 
ho(gof)=(hog)of 


(associative law) 
(2) Define f :2->2 by 


x’, ifx20 
ee 
Define g: 2-2 by 


Qe x+1, ifx21 
PU Vis ap et 


Find formulae for the functions go f and f og. Use this example 
to show that g° f = f og is not in general true. 
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(3) Define f: 2-8 and g:2-Z by 


fQ)=x? 


ne x+1, ifx>0 
Oe a0). Gt ee 0 


Find g° f and f og. In each case say whether the function is one- 
one, onto, both, or neither, and identify the ranges of each of the 
functions f, g,g°f, fg. 


(4) Let A = { {1, 2}, {1, 3}, {2}, {2, 5}, {3, 4, 5}}. Define f: 43> 
by 
f (a) =sum of the elements of a (VaeA) 


Find f[A]. Now define g: V—> / by 


the largest prime < n, if n> 1, 
gin)=foe P 
lifn=1 


Find gf and identify its range. Is f one-one? Is ge f one-one? 


Let f:A—-B. We say f is invertible if there exists a function 
g:B—A such that for all ae A, beB, 


f(a) = b<g(b) =a 


Theorem 2.2 


Let f :A—B. Then f is invertible iff it is a bijection, in which case the 
function g as described above is unique. 


Proof 

Suppose f is invertible. We prove that f is a bijection. By the 
invertibility of f there is ag: B— A as above. Let a,,a,€A, a, #4). 
We prove that f(a,)# f(a,), which shows that f is one-one 
(since a,,a, are arbitrary). Suppose, on the contrary, that 


f(a,) = fla,)=b 


By the property of g we have g(b)=a, and g(b)=a,. But g is a 
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function. Hence a, =a,, a contradiction. Thus f(a,)# f(a,), and f is 
shown to be one-one. Let be B now. We show that f(a) = b for some 
ae A, which shows that f is onto (since b is arbitrary). Let a = g(b). By 
the properties of g, f(a) = b, so we are done. Hence f is a bijection. 

Conversely, assume now that f is a bijection. We show that f is 
invertible. 

Let beB. Since f is onto there is aéA such that f(a) =b. But f is 
one-one, so there cannot be another such a. Hence, for each b in B 
there is exactly onea in A with f(a) = b. Define g : B— A by the rule: 
for each beB, g(b) is the unique element a of A for which f(a) =b. 
Clearly, g is as required for invertibility of f Since this is the only 
possible definition of g, we see also that g is unique. QED. 


The unique function g as above is called the inverse of f, and 
denoted by f~}. 


Exercise 2.9 


(1) Let f :A—B be invertible. Show that 


folofsl, fof ~t=Is 


(2) Define f:2- by 
x? +1, ifx20 
soya Pe ifx<0 


Show that f is a bijection and find f~'. 

(3) Define f : 2? > R? by f(x, y)=(x + 2y, x — y). Show that f is a 
bijection and find f~'. 

(4) Let f:A—-B and g:B-C be invertible. Show that gof is 
invertible and that (go f)>'=f7'og™. 

(5) Let A be the set of all countries, B the set of all capital cities of the 
world. Define f :A—B by 


{@ =the capital of the country a (VaeA) 


Describe the inverse function f~ *. 


CHAPTER 3 


The real numbers 


3.1 The real line 


In this book we assume that you already have a reasonable idea of the 
concept of a real number. But why do we need to introduce the real 
number system in the first place? In real life, we are never able to 
measure any quantity (e.g. length or temperature) to more than a few 
decimal places, for which the rational numbers suffice. Indeed, the 
rational numbers suffice for the measurement of any quantity to 
whatever degree of accuracy is required. This fact is illustrated by the 
following simple result: 


Theorem 3.1 
If r, se2, r<s, there is te2 such that r<t<s. 
Proof 


Let t=43(r +5). Clearly, r<t<s. But is te2? Well, letting r = m/n, 
s = p/q, where m, n, p, qag&. We have 


1 + 
ee (m42\=™4 np 


2nq 


so as mq + np, 2nqe®, we conclude that te2. QED. 


The trouble is, although the rationals are ‘dense’ in the sense of the 
above theorem, there are nevertheless ‘holes’ in the rational line. For 
example, if we let 


A= {xE9|x? < 2} 
B= {xeQ|x? >2} 
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then every element of A is smaller than every element of B, and 
AUB=2 


But A has no greatest member and B has no smallest member (as the 
reader can easily check for himself), so there is a sort of ‘hole’ between 
A and B. This is the ‘hole’ where ,/ 2 ought to be, of course. The fact 
that 2 contains ‘holes’ makes it unsuitable for mathematical pur- 
poses, even though it suffices for all our measurements. Indeed, a 
number system in which the equation 


x? -2=0 


has no solution is a pretty weak one. So we agree that for 
mathematics, we require a number system which does not contain 
‘holes’, and in which we can take square roots, etc. (at least for positive 
numbers). But what are the real numbers? If we ask what the rationals 
are, we can answer by referring back to the integers, since all rationals 
are essentially quotients of integers. And the integers are so basic that 
for most purposes this suffices as an answer. But when the same 
question is asked of the reals the answer is not so easy. The point is, 
although the rationals are an abstract system of entities, and thus 
nothing more than figments of our intuition, they correspond to such 
basic concepts that they possess a certain concreteness. (We can 
calculate a number to 30 decimal places, even 50, and to get all the 
rationals we just need to imagine that we could calculate to any finite 
number of places, which does not require too great a feat of 
imagination.) But the concept of an irrational number is already 
much more abstract. The result (proved in Chapter 1) that it is utterly 
impossible to calculate the decimal expansion of ,/2, that its 
expansion continues for ever, without recurrence, defies the ‘finite’ 
imagination. We cannot ‘construct’ such a number, not even in 
theory. So in what sense are we justified in regarding, say, a: 2asa 
number at all? The answer is, only in a strict mathematical sense. One 
constructs the real number line by an abstract mathematical con- 
struction which, essentially, ‘fills in all the holes’ in 2. This con- 
struction is quite complicated, and was one of the major mathemati- 
cal accomplishments of the 19th century. It only answers the question 
‘What is a real number?’ in an abstract, mathematical sense. Its value, 
really, is that it does tell us that we are not going to run into any 
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trouble if we imagine that the ‘holes’ in 2 are all filled in. And the 
advantage of so doing is that the mathematical possibilities multiply 
enormously. (For instance, the calculus cannot be developed over 2, 
but it can over Z&.) 

For our present purposes, we shall take as a sort of ‘definition’ of a 
real number the points on the real line. (This is, of course, a circular 
‘definition’, and hence no definition at all, but it provides us with a 
useful and intuitive starting point. If you object violently, you can try 
to imagine starting with the rational line and ‘filling in’ the ‘holes’, but 
even trying to find a ‘hole’ is not easy!) The set of reals (i.e. the points 
on the real line) is denoted by &. The elements of Z possess a natural 
ordering ( < ), which corresponds to the ‘ordering’ of the points on the 
real line. 

The fact that the real line has no ‘holes’ is expressed by the so-called 
completeness property, described below. 


3.2 Upper bounds. Completeness 
Let ACR. Any real x such that 


(VaeA) (ax) 


is called an upper bound of A. We call x a least upper bound (l.u.b.)if 
there is no smaller upper bound: that is if, whenever y < x, y fails to be 
an upper bound of A. Clearly then, x is a least upper bound of A if 
(VaeA) (a <x), and whenever y < x there is anaeA such thata> y. A 
set A of real numbers does not have to have any upper bound, of 
course. For instance, the set ”< @& has no upper bound. But if A 
does have an upper bound, it will necessarily have a least one. This is 
the completeness property of the real line. 


Completeness property 
Let A<&. If A has an upper bound, then it has a least upper bound 
(in &). 


Clearly, if A has a least upper bound, this must be unique. (If x and y 
were both least upper bounds, one would have to be smaller than 
the other, say x < y, so y would not be a least upper bound.) 

The rational line, 2, does not have this property. For instance, the 
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set 
A= {reQ|r? <2} 
is bounded above in 2 by 2, but it has no least upper bound in 2. To 
see this, let xe2 be any upper bound of A. We show that there is a 
smaller one (in 2). Let x = p/q, p, qé-V. Suppose first that x? <2. 


Thus 2q? > p?. Now, as n gets large, the expression n?/2n + 1 increases 
without bound, so we can pick né./ so large that 


n2 p? 
a Oe 
2n+1° 2q?—p? 
Rearranging, this gives 


2n?q? > (n+ 1)? p? 


Hence 
22 
(+) Fe? 
n q 
Let 
n+1 
yee 
nq 


Then, since (n + 1)/n> 1, we have x < y. But y is rational and we have 
just seen that y” <2, so yeA. This contradicts the fact that x is an 
upper bound for A. It follows that we must have x? > 2. Since there is 
no rational whose square is 2, this means that x? > 2. Thus p” > 2q, 
and we can pick ne.V so large now that 
2 

een: Se 
2n+1° p?—2q? 


2 


which becomes, upon rearranging, 


p?n? > 2q4?(n + 1) 


2 2 
ie. &(4) >2 
q’\n+1 
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n 


aes 


Qs 


Since n/(n + 1) < 1, y < x. We complete the proof by showing that y is 
an upper bound for A. Let ae A. Thus a? < 2. Thus 


22 
pan eee ae) 
- (4)5 y 


Thus a < y, as required. 
On the other hand, the set A (regarded as a subset of 2 now) does 
have a least upper bound in &, namely ,/2. 


Exercise 3.1 


(1) Let AS &, xe&. Show that x is the l.u.b. of A iff (a) and (b) below 
are valid: 


(a) (VaeA)(a< x) 
(b) (Ve > 0)(aae A)(a > x — 6) 


(2) By negating (a) and (b) above, obtain a symbolic definition of the 
property ‘x is not the l.u.b. of A’. (Put your answer in a positive 
form.) 

(3) Besides the completeness property, the Archimedean property is an 
important fundamental property of #. This says that if x, ye# 
and x, y>0, there is an ne.V such that nx > y. 

Use the Archimedean property to show that ifr, se#@ and r<s, 

there is a gq¢2 such that r<q<s. (Hint: pick nev, n> 1/(s—7), 

and find an me. such that r <(m/n) < s.) 


3.3 Absolute values 


Recall from Chapter 2 the definition of the absolute value function 


|= x, if x 20 
~ )—x, ifx <0 


We prove some simple results about this function. 
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lal =?) 
(By f x is always meant the positive square root.) 
Proof 


Clearly, |a|? =a”. Taking square roots, Jal?) =\/(a’). But |a| > 0. 
Hence / (al?) = |al, and the theorem is proved. 


Theorem 3.3 
|ab| = |a\|b| 


Proof 
There are four possible cases: 


(I) a20,b620. Then |ab| =ab = |al|b]. 

(II) a20,b<0. Then |ab| = — (ab) =a( — b) = |al|. 

(III) a<0,b20. Then |ab| = — (ab) =(—a)b = |al|b}. 

(IV) a<0,b<0. Then |ab| =ab =(—a)(—5)=|al|b|. QED. 


Theorem 3.4 
Let a> 0. Then |x| <a iff -a<x Sa 


Proof 
Exercise for the reader. 


Theorem 3.5 (Triangle inequality) 
la + b| < lal + [b| 


Proof 
Clearly, by Theorem 3.3, 


ab & |ab| = |al|b| 
Thus 


2ab & 2Ia\|b| 
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So 
a? + 2ab + b? <a? + 2Ia\|b| + b? 


i.e., by Theorem 3.2, 


(a +b)? < lal? + 2Ial|b| + |b/? 


(a +b)? < (lal + |b|)? 
Hence 
Jat bP </(\a| + |b1)? 
ie., by Theorem 3.2, 


la + b] <|al + || 
QED. 


If x, ye&, the positive number |x — y| represents the ‘distance’ 
between x and y on the real line. If x # y, then |x — y| > 0. Conversely, 
if |x — y|>0, then x # y. Hence x = y iff |x — y| =0. By the triangle 
inequality, if x,y,.ze#, we have 

Ix — yl <|x — 2] + |z— y| 


This inequality is also referred to as ‘the triangle inequality’. 


3.4 Intervals 
Certain types of subset of @ occur so frequently that it is convenient to 
introduce a special notation for them. Let a,be&, a < b. 
The open interval (a,b) is the set 
(a,b) = {xeRla<x <b} 


The closed interval [a,b] is the set 


[a,b] = {xeRla<x <b} 
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The point to notice here is that neither a nor b is an element of (a,b), 
but both a and b are elements of [a,b]. (This seemingly trivial 
distinction turns out to be highly significant in elementary Real 
Analysis.) Thus, (a, b) is the stretch of the real line beginning ‘just past’ 
a and ending ‘just before’ b, whilst [a, b] is the stretch beginning at a 
and ending at b. 

The above notation extends in an obvious manner. We set 


[a,b) = {xeRla<x <b} 
(a,b] = {xEBla<x <b} 


[a, b) is left-closed and right-open; (a, b] is left-open and right-closed. 
We refer to either [a,b) or (a,b] as a half-open (or half-closed) interval. 
Finally, we set 


(— ©, a) = {xER|x <a} 
(— 0, a] = {xER|x <a} 
(a, 00) = {xEB|x >a} 
[a, 00) = {xeB|x 2a} 


Notice however that the symbol oo is never coupled with a square 
bracket. This would be misleading, since oo is not a number, just a 
useful symbol. In the above definitions it simply helps us to extend a 
notation to cover another case. 

Any set of the above forms is called an interval. Thus, an interval is 
just an uninterrupted piece of the real line. 


Exercise 3.2 


(1) What is l.u.b. (a, b)? What is l-u.b. [a, b]? What is max (a, b)? What 
is max[a, b]? 

(2) Let A = {|x — y| |x, ye(a, b)}. Prove that A has an upper bound. 
What is l.u.b. 4? 

(3) Prove that the intersection of two intervals is again an interval. Is 
the same true for unions? 


(4) Taking & as the universal set, express the following in terms of 
unions of intervals: 


(a) [1, 3)’ 
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(b) (1, 7]' 

(c) (5, 8]' 

(d) (3, a 8] 

(e) (—%, 3)'U(6, 0) 

(f) A’, eae A=(—o, 5]U(7, 2) 
(g) A’, where A =(6, 8)N(7, 9] 

(h 

(i 


) 
) (1, 2n[2, 3) 
(1, 4] 0[4, 10] 


3.5 Sequences 


Suppose we associate with each natural number n a real number a,,. 
The set of all these numbers a,, arranged according to the index n, is 
called a sequence. We denote this sequence by 


{a,} 
Thus, the sumbol {a,} represents the sequence 
G5 055. 03,.6 eg es 


For example, the members of themselves constitute a sequence, 
when assigned their usual order 


1,225°3) dtp Ajtace 


This sequence would be denoted by {n} (because a, =n for each n). 
Or we could order the elements of / in a different manner to 
obtain the sequence 


2, 1, 4, 3, 6, 5, 8, 7,... 
This is quite a different sequence from the sequence {n}, since the 
ordering in which the members of the sequence appear is important. 
Or, if we allow repetitions we get a completely new sequence 


1, 1, 2, 2, 3, 3, 4, 4, 4, 5, 6, 7, 8, 8... 


(There does not need to be a nice rule involved: it may be impossible 
to find a formula to describe a, in terms of n.) Again, we can have a 
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constant sequence 
Tl, TM, Tt, M, M,..-5 Ty... 

or an alternating (in sign) sequence 

1,-1,1, -1,1, -1,... 
In short, there is no restriction on what the members of a sequence 
{a,} may be, except that they be real numbers. What counts is the 
order in which the members of the sequence are placed. 

Now, some sequences have a rather special property. As we go 


along the sequence, the numbers in the sequence get closer and closer 
to some fixed number. For example, the members of the sequence 


Hidde a 
n iy ae ae ee 


get closer and closer to 0 as n gets larger, whilst the members of the 
sequence 


1 
jr+zteus fat as: 


get closer and closer to 1. And the members of the sequence 
3, 3-1, 3-14, 3-141, 3-1415, 3-14159, 3-141592, 31415926, ... 


get closer and closer to z (though this example is not as good as the 
others, owing to our not giving any general rule for the nth term in the 
sequence). 

If the members of the sequence {a,} get closer and closer to some 
fixed number a, we say that the sequence {a,} tends to the limit a, and 
write 


a,7a aS n->o 


(The arrow means ‘approaches without bound’. So, as n gets larger, 
without bound, so a, gets closer to a without bound.) 
So far, this is all very intuitive. Let us see if we can obtain a precise 
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definition of what it means to say a, >a as n— 00. Well, to say thata, 
gets closer and closer to a, without bound, is to say that the difference 
la, — a| gets closer and closer to 0, without bound. This is the same as 
saying that whenever ¢ is a positive real number, the difference |a, — al 
is eventually less than ¢. This leads to the following definition: 


a,7a as n> 
iff 
(Ve > O)(Ane.V)(Vm 2 n)(a,, — al < €) 


This looks quite complicated. Let us try to analyse it. Consider the 
part 


(dne.V)(Vm 2 n)(la,, — al < ) 


This says that there is an n such that for all m greater than or equal to 
n, the distance from a,, to a is less than «. In other words, there is ann 
such that all terms in the sequence {a,} beyond a, lie within the 
distance ¢ ofa. We can express this concisely by saying that the terms 
in the sequence {a,} are eventually all within the distance ¢ from a. 
Thus, the statement 


(Ve > 0)(Sne.V)(Vm 2 n)(\a,, — al < 8) 


says that for every e>0, the members of the sequence {a,} are 
eventually all within the distance ¢ from a. This is just the formal way 
of saying that a, gets closer and closer to a, without bound. 

Let us consider a numerical example. Consider the sequence {1/n}. 
We ‘know’ that 1/n—-0 as n>. We shall see how the formal 
definition works for this sequence. We must prove that 


“ 


(Ve > O)(AneV)(Vm 2 n( 2 < :) 


(Ve > 0)(Ane WV )(Vm > n( fb -0 


This simplifies at once to 
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To prove that this is a true assertion, let ¢ > 0 be arbitrary. We must 
find an n such that m2n=>1/m<ce. Pick n large enough so that 
n> 1/e. (This uses the Archimedean property of #!) If now m2n, 
then 1/m<1/n<e. In other words, (Vm 2 n)(1/m < 8), as required. 
The point to notice is that our choice of n depended upon the value of 
&. The smaller ¢ is, the greater needs to be our n. 

Another example is the sequence {n/(n + 1)}, ie. 


23 
vy 


als 


pee 


Ne 


We prove that n/(n + 1) lasn— o. Leté > 0 be given. We must find 
asnéW such that for all m 2 n, |m/(m + 1) — 1| < ¢. Pick nso large that 
n> 1/e. Then, for m2 n, 


—1 1 
——— — |} = |_| = —__ < — <-<¢g 
m+1 m+1} m+1mi on 


as required. 


Exercise 3.3 


(1) Formulate both in symbols and in words what it means to say 
that a, a asn>o. 

(2) Prove that (n/n+ 1)?1 as n> oo. 

(3) Prove that 1/n?>0 as n> 00. 

(4) Prove that (1/2)"—0 as n> oo. 

(5) We say a sequence {a,} tends to infinity if, as n increases, a, 
increases without bound. For instance, the sequence {n} tends to 
infinity, as does the sequence {2”}. Formulate a precise definition 
of this notion, and prove that both of these examples fulfill the 
definition. 

(6) Let {a,} be an increasing sequence (i.e. a, <a, , for each n). 
Suppose that a, >a as n— 00. Prove that a =Lu.b.{a,, a), a3, ...}. 


CHAPTER 4 


Complex numbers 


4.1 Number systems 


In this book we do not propose to make a detailed study of number 
systems. However, in order to motivate to some extent the in- 
troduction of complex numbers, we consider briefly the development 
of the real number system. 

The simplest number system of all is the natural number system: 


HN =0,2,3,0.3 


with the usual operations of addition and multiplication. (Usually, 
when we speak of a number system, we mean the numbers concerned 
together with the operations of addition and multiplication on those 
numbers.) Though perfectly adequate as a system for ‘counting’ 
elements of finite sets, the system is mathematically inadequate. 
Equations such as 


x+3=2 


have no solution in the system (though all the constants in the 
equation do lie in the system). This particular deficit is overcome by 
extending -V to a larger system, the integers: 


¥ ={0,+1,+2,+3,..3 


Given the natural number system, the system of integers may be 
defined, in a precise mathematical manner, from it: that is, we may 
construct the set & and define the operations of addition and 
multiplication on this set, assuming only the existence of the natural 
number system (together with a few very simple set-theoretic 
operations). (Moreover, we can prove that these operations have all 
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the familiar properties: commutativity, associativity, etc.) Though 
easy, the details of this procedure are not within our present scope. 
Now, in &, all equations 


xta=b (a, be#) 


have solutions, but Z is still mathematically deficient. For instance, 
the equation 


3x =2 


has no solution in 2. We overcome this problem by extending % to 
the system 2 of rational numbers. This involves constructing the set 2 
and defining the operations of addition and multiplication on this set 
(as well as proving that these operations have all the usual 
properties). Again we do not go into the (easy) details. 

In 2, all equations 


ax =b (a, be 2) 


have solutions (except when a = 0, but then there is no equation!). 
Nevertheless, 2 is mathematically deficient, since, for example, the 
equation 
x7=2 

has no solution in 2. To overcome this difficulty we extend 2 to the 
real numbers, &. Again, this involves constructing the set &, defining 
the operations of addition and multiplication on &, and proving that 
these operations behave as we want them to. However, this time it is 
not so easy. We should not expect it to be. For, whereas we can easily 
‘imagine’ what is meant by a ‘negative integer’ (given that we know 
what a natural number is), and what is meant by a ‘rational number’, 
the concept ofan irrational real number defies the (finite) imagination. 
(We can evaluate numbers to 10 decimal places, even 50 or 100; and 
we can easily conceive of evaluating a number to any finite number of 
decimal places; but we can quite literally never evaluate a number to 
infinitely many places. In introducing irrational numbers, the notion 
of the ‘infinite’ first plays a significant role. And we can never truly 
picture any infinite quantity.) So, whereas we may feel quite happy 
with the extensions of V to Z and of & to 2, the step from 2 to Z is 
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quite unnerving. The best ‘picture’ we have is that of ‘filling in the 
holes’ on the rational line. However, as we showed in Chapter 3, 
Section 3.1, in a sense there are no ‘holes’ in 2. (More precisely, 2 has 
no finite holes, only infinitely small (?) ones.) So, for the extension 
from 2 to & we need to rely entirely on our mathematics to keep us 
out of danger. And the mathematics required is quite considerable. 
Indeed, it was not until the latter part of the 19th century that the 
mathematicians Cantor and Dedekind (independently) first managed 
to provide a rigorous construction of @ from 2. 

With the real numbers, we have our first system of numbers which 
allows us to develop some really meaningful and useful mathematics. 
For instance, in @ we can develop the calculus, with its many 
applications in physics and engineering. Or we can extend our theory 
of geometry by introducing the notion of systems of coordinates. 
And so on. 

However, strong through the system & is, it is still mathematically 
deficient. For instance, in # the equation 


x?+1=0 


has no solution. To overcome this problem, we extend & to a larger 
number system, the complex numbers. We describe the extension 
procedure below. 


4.2 Complex numbers 


Our aim is to construct a system of numbers which extends @ and 
allows the solution of, in particular, the above equation. 
That is, we need to do three things: 


(a) Construct the set of new numbers. 
(b) Define the operations of addition and multiplication on this set. 
(c) Prove that everything behaves as it should. 


There are various ways to do this. We describe the most common way, 
but the precise way chosen does not matter. Just ‘what’ a complex 
number will be is not important. It is the behaviour of the entire 
system that counts. This point often escapes the beginner. He/she is so 
used to using real numbers, that he/she ascribes to them a sort of 
concrete ‘existence’, forgetting that they are nothing more than 
figments of the (mathematical) imagination. Thus the (totally mis- 
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leading) idea that real numbers are somehow more ‘real’ than 
complex numbers. (The appalling use of the words ‘real’ and 
‘imaginary’ does not help matters. No number is ‘real’; they are all 
‘imaginary’—in the everyday meaning of these words!) Indeed, the 
really big leap comes in the passage from 2 to &; the subsequent 
passage from & to @ is no worse than that from & to 2 (and in fact is 
mathematically very similar). So whereas it is forgivable to regard 
rational numbers as ‘concrete objects’ (‘forgivable’, note, not ‘cor- 
rect’), it is a crime of the highest order to regard all real numbers as 
concrete objects. That this crime is so widespread exemplifies the old 
saying ‘familiarity breeds contempt’. 

So, having (we hope) disposed of any feelings that we are about to 
do something new/bold/foolish/avant-grade/miraculous/deep, let us 
describe briefly the construction of the complex number system. 

The system & is given. Hence we may construct the Euclidean 
plane Z?. More formally, Z? is the Cartesian product 2 x A, the set of 
all ordered pairs (x,y) where x and y are real numbers, the pair (x, y) 
being the formal counterpart to the ‘point’ in the Euclidean plane 
whose coordinates are x and y. 

Our first definition is:a complex number is an element of 2”. So now 
we know what a complex number is: it is nothing more nor less than 
an ordered pair of real numbers. (Very likely you have by now 
forgotten the remarks made just a few paragraphs ago, and have 
regressed into regarding real numbers as somehow ‘real’. In which 
case our last definition will strike you as odd. All we can say is that if 
we were to explain what a real number actually IS, you would 
probably wish you had decided to study music or art or ancient 
languages or whatever. In comparison, our definition of a complex 
number is very ordinary.) 

Our next task is to define the operations of addition and multipli- 
cation of complex numbers. 

If (a, b), (c, d) are complex numbers, we define the sum 


(a, b) + (c, d) 
to be the complex number 
(a+c,b+d) 


It is easy to prove that addition of complex numbers, so defined, is 
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both commutative and associative. Moreover, the complex number 
(0,0) acts as an additive identity [i.e. (a, b)+(0,0)=@, b) for all 
complex numbers (a, b)], and the complex number (—a, — 5) is the 
additive inverse of (a, b) [i-e. (a, b) + (—a, — b) = (0, 0)]. 

We do not need to define subtraction as such. Writing — (a, b) for 
the additive inverse of (a, b) [namely (—a, — b)], we ‘define’ sub- 
traction by: 

(a, b) —(c, d) =(a, b)+[ —(c, d)] 


Thus, 
(a, b) —(c, d)=(a-—c, b—d) 
The product of the complex numbers (a, b), (c, d) is defined thus: 
(a, b)(c, d) = (ac — bd, ad + bc) 


The reason for choosing this odd looking definition [rather than, say, 
the ‘obvious’ definition (a, b)(c, d) = (ac, bd)] will become clear later. 
But let us remark that we are entirely free to define multiplication 
however we wish. However, unless we define it as above, we will not be 
defining the system of complex numbers, but rather some other 
system. ‘So what is so special about our chosen definition?’, you ask. 
Well, the proof of the pudding lies in the eating: with the definition 
given, we obtain precisely the number system we want! The many 
applications of complex numbers both within mathematics and 
without testify to the wisdom of our definition. (The beginner is often 
surprised to hear that complex numbers are really ‘useful’. And yet 
he/she accepts that irrational real numbers are ‘useful’, even though 
we can never measure any actual quantity to more than a few decimal 
places (and certainly not to infinitely many places!). Once more the 
reason for the discrepancy lies in the familiarity with the one system 
and the strangeness of the other.) 

It is a routine matter to verify that multiplication of complex 
numbers is commutative and associative, and that the distributive law 
holds for multiplication and addition [ie. 2(u+v)=zu+4 zu, for 
complex numbers z, u, v]. Moreover, the complex number (1,0) acts as 
a multiplicative identity, and (a, b) (0, 0)=(0, 0) for all complex 
numbers (a, b). We prove later that division of complex numbers is 
always possible (except for division by zero, i.e. (0, 0)). 
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4.3 The complex plane 


Having now defined the complex number system, can we obtain some 
sort of intuitive picture of the system? For instance, although (as you 
now know!) the real number system is a highly abstract and technical 
system, we nonetheless have the comforting picture of the real line, a 
continuous straight line stretching out to infinity in both directions. 
Well, in fact whereas we need a considerable amount of effort in order 
to relate this picture to the (complicated) mathematical system that is 
the real numbers, our definition of the complex numbers itself 
provides a visual interpretation: the two dimensional plane #7). 
Instead of a straight line (for 2) we have a plane, stretching out to 
infinity in all directions. When we think of the plane #? in this 
manner, we refer to it as the complex plane. (Thus we have a real line 
and a complex plane.) 

One point to notice is that whereas the real numbers have a natural 
ordering (they lie on a line!), the same is not true of the complex 
numbers. There is no natural notion of ‘less than’ for complex 
numbers. 

If(a, b)is acomplex number, we refer to the real number a as the real 
part of (a,b), and to the real number b as the imaginary part of (a,b). The 
use of these words is not intended to convey any deep meaning, but is 
due to the way complex numbers were developed historically. The 


Im 


Figure 4.1 The complex plane 
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real part of a complex number z is denoted by Re(z), the imaginary 
part by Im(z). Notice that both Re(z) and Im(z), are real numbers. 
Thus, in the complex plane, the x-axis is referred to as the real axis, the 
y-axis being the imaginary axis (Fig. 4.1). 

Having now defined the complex numbers, in what sense is this 
system an extension of the reals? According to our definition, no real 
number is a complex number (since all complex numbers are ordered 
pairs of real numbers). However, as we mentioned earlier, with 
number systems it does not matter exactly what a number is, it is the 
behaviour of the entire system that counts. Consider now the complex 
numbers of the form 


(x, 0) 
If we add two of these we obtain another such: 
(x, 0) + (y, 0) = (x + y, 0) 
And multiplication is also simple in this case: 
(x, O)(y, 0) = (xy, 0) 


Indeed, as we can now see, the operations of complex addition and 
complex multiplication on the complex numbers of the form (x, 0) 
reduce to the operations of real addition and real multiplication on 
the first coordinates. The second coordinate—0—plays no role at all. 
Hence, the subsystem of the complex numbers consisting of the 
numbers of the form (x, 0) is just a thinly disguised version of the real 
number system. (In formal mathematical jargon, we say that the 
above subsystem of @ is isomorphic to the system &.) So, in view of our 
earlier remark, when we consider complex numbers, we may as well 
regard a complex number (x, 0) as the same as the real number x. That 
is, there is no point in distinguishing between (x, 0) and x: just write x 
instead. In this way we obtain the set-theoretic inclusion 


RoE 


And in the sense of complex number theory, a ‘real number’ is just a 
complex number whose imaginary part is zero. Pictorally, in the 
complex plane, the ‘real line’ is just the real axis (‘Re’ in Fig. 4.1.) 
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Notice that as a result of our above discussion, if x is a ‘real number’ 
and (a, b)e@, then the product x(a, b) is the complex number 
(xa, xb) (i.e. x(a,b) = (x, 0)(a, b) = (xa — 0b, xb — 0a) = (xa, xb)). 


4.4. The complex number i 


We have already investigated complex numbers of the form (x, 0). 
What about those of the form (0, y)? Such a number is said to be a 
(pure) imaginary number. (Unfortunately we are stuck with this name, 
misleading though it is!) Any such number can be expressed as a 
product ofa real number and the particular imaginary number (0, 1), 
namely 


(0, y) = y(0, 1) 


(This is also true for any fixed imaginary number (0, k) in place of 
(0, 1), of course, provided k # 0, but it will be seen that the choice of 
k = 1 is most natural.) The imaginary number (0, 1) has a remarkable 
property. Squaring gives 


(0, 1)? =(0, 1)(0, 1) = (0.0 — 1.1, 0.1 + 1.0) = (— 1, 0) 
ie. 
(0, 1)? = —-1 


Thus, (0, 1) is a square root of — 1! 

Denoting the number (0, 1) by i, we can now obtain a convenient 
alternative notation for complex numbers. For if (x, y) is any complex 
number, then 


(x, y) = (x, 0) + (0, y) =(x, 0) + y(0, 1)=x+yi 


But be very careful when using this notation. The symbol + refers 
throughout to complex addition. There is no intention that the real 
numbers x and y should be added (in the sense of 2) here. And the 
symbol i is just a convenient abbreviation for the complex number 
(0, 1). 

With this notation, complex multipication is easily appreciated. 
For we now see that it is essentially a result of ordinary algebra. Given 
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complex numbers a + bi and c + di, we have, by algebra 
(a + bil(c + di) =a(c + di) + bi(c + di) 
=ac +adi + bei + bdi* 
But i* = — 1. Hence 
(a + bil(c + di) =ac +. adi + bci — bd 
= (ac — bd) + (ad + be) 


Translation of this equality back into the ordered pair notation yields 
our original definition 


(a, b)(c, d) = (ac — bd, ad + bc) 

When we deal with complex numbers, we usually adopt the notation 

at bi 
Notice that if z=a+ bi, then a = Re(z) and b = Im(z). Also, 

(at bi)+(c+di)=+c)+(b+d)i 
And if z, w are complex numbers, than 
z=w iff [Re(z)= Re(w) and Im(z) = Im(w)] 
This last fact is extremely useful in calculations with complex 
numbers. For example, suppose we wish to find the square roots of 
the complex number 3 + 4i. We begin by letting x + yi be a square 
root of 3 + 41. Thus 
(x + yi)? =3 + 4i 

Expanding the left hand side we get 


x? + Axyit y7i? =3 + 4i 
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Noting that i? = — 1, this becomes 
(x? — y?) + 2xyi=3 4+ 4i 
Equating real and imaginary parts we obtain the equations 


x*-y?=3 
2xy=4 


Eliminating y we get 


Le. 
x* — 3x?-4=0 
This factorises to give 
(x? — 4)(x? + 1)=0 
Since x is real, x? + 1 #0. Hence 
x?-4=0 

Thus x = + 2. So, using the equation 

y=2/x 


we obtain the solutions x = 2, y= 1 and x = —2, y = — 1. Thus the 
required square roots are 


+(2+1) 
This answer may be checked by direct evaluation: 


(24+i1? =44+41+i7?=44+41-1=344i 
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Exercise 4.1 
(1) Evaluate 


(a) (2+3i)+(6 + 5i) 
(b) 3(1 +i) — 49 —2i) 
(c) (3+ 19 -6i) 

(d) (1 +i)i 

(e) i® 

(f) i 

(g) (1-i)? 


(2) Find the complex square roots of 


(a) i 
(b) —3+4i 


(c) 1+(4/3)i 


(3) Which complex number is both real and imaginary? 
(4) On the complex plane, plot the numbers 1, — 1, i, —i and the 
square roots of i (from question (2)(a)). 


4.5 Conjugate complex numbers. Division of complex numbers 


If z=a-+ bi is a complex number, the complex conjugate of z is the 
complex number z =a — bi. The relationship between z and 2 is 
illustrated in Fig. 4.2 overleaf. 

Our main reason for introducing the complex conjugate lies in the 
fact that it enables us to show that division by complex numbers is 
possible. For let z =a -+ bi be any non-zero complex number. Then 


2z = (a + bila — bi) = a* — abi + abi — b?i? =a? + b? 


Ti 
| aap? | =a 


In other words, the multiplicative inverse of z exists and is Z/(a? + b?). 
This enables us to divide by (non-zero) complex numbers. For 
example, suppose we wish to divide 1+i by 3+4i. Then we 
commence by multiplying both numerator and denominator by 


which is real. Hence 
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Figure 4.2 Complex conjugates 


3+41: thus 


+i  1+i 3-41 (1+iG-4i)) 3-41+31-47 
344 34413-41 (3441-41) 9— 1214 121-161? 
ons es Ns a | 


0416 95 35-35" 


We may check this by multiplying the answer by 3 + 4i: 


“ia Wage me eae 
A-pi)-B-wt BS 
Bie res 
735° 357 35" 
ee 


i2 


(3+ si( 


Exercise 4.2 
(1) Evaluate 
(a) 1/1 +1) 
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(b) 13-3) 

(c) (+i —1) 
(d) 6/5 + 2i) 

(e) IA 

(f)  if(2 + Si) 

(g) (1+ 3i)2 -i)? 


= 


(2) Prove that a complex number z is real iff z = z. 
(3) Prove that 1/z = 1/2 for any non-zero complex number z. 
(4) Prove that zw = zw for any complex numbers z, w. 
(5) Prove that 7 w =2z-+ w for any complex numbers z, w. 
(6) What is z? 
(7) Prove that Re(z) =(z + z)/2 and Im(z) = (z — 2)/2i. 
(8) Prove that zw + Zw is always real (hint: prove that zw + Zw = 
2Re(zw)). 
(9) Prove that zw — zw is always imaginary. 
(10) Define f: @-@ by f(z) =z. Is fa bijection? If so, what is f ~ 1? 


4.6 Polar representation of complex numbers 


Since complex numbers are essentially just the points in the complex 
plane, any coordinate system for Z? will suffice to determine complex 
numbers. Using polar coordinates we are able to obtain a repre- 
sentation of complex numbers which is particularly well suited to 


Im 


Figure 4.3 Polar representation of complex numbers 
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complex multiplication (which hitherto has appeared ‘complex’ in the 
everyday sense of this word!) 

In order to specify the complex number z = x + yi(Fig. 4.3) we need 
only give the distance r from the point z to the origin and the angle 0 
made by the line 0z and the real axis. We call (r, 0) the polar coordinates 
of the point z in the complex plane. Here @ is measured in an 
anticlockwise direction from the real axis, and is subject to the 
constraints 


0<0<2n 
(It is usual to express @ in radians in this context.) The number r will be 
a positive real (or zero), but is otherwise unrestricted. 


The relationship between the polar coordinates and the usual 
coordinates is expressed by the equations 


x = Re(z)=rcos@ 
y = Im(z) = rsin 0 


Thus 
z=x+iy=r(cos@+isin 6) 


We usually use cis @ to abbreviate the expression cos @ + isin 0. 
Thus 
z=rcis0 


This is the polar representation of z. 0 is the argument of z, arg(z). ris the 
modulus of z, |z|. Notice that by Pythagoras’s theorem, 


lz] = /(x? + y?) 
Hence, if z is real, |z| is just the ‘usual’ modulus or absolute value of z in 


the sense of &. 


Exercise 4.3 
(1) Express the following complex numbers in polar form: 


(a) 1 
(b) —1 
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(c) i 
(d) —i 


(e) 1+i 
(f) 3441 
(g) 5—12i 


(2) Express the following complex numbers in ‘rectangular form’ 
(i.e. aS x + yi): 


(a) 3cis7/4 
(b) 4cisx/6 
(c) cisSz/6 


(3) Prove that if ze@ and xe, then arg(xz)=arg(z) and 
|xz| = |x||zI. 

(4) Prove that |z|? = zz. 

5) Prove that Re(z) < |Re(z)| < |z]. 

6) Prove that Im(z) < |Im(z)| < |z]. 

7) Prove that |zw| = |z||wl. 

8) Give an example to illustrate that |z+w| # |z| + |wl. 

9) Prove that |2| = |z]. 

0) Prove the triangle law: |z + w| < |z| + |w| (hint: write |z + wl? as 
(z+ w)(z+w) and use the results of questions (4), (5), (8) of 
Exercise 4.2, together with the questions (5) and (9) of this 
exercise. 


( 
( 
( 
( 
( 
1 


( 


4.7 Multiplication of complex numbers in polar form 


Let z, w be complex numbers. In polar form, let z=rcis 0, w =scis ¢. 
Then 


zw =(rcos@ + irsin 8)(scos @ + is sin d) 


=rscos@cos ¢ + irscos Osin @ + irssin# cos @ + 
i7rs sin @ sin b 
=rs[(cos Ocos ¢ — sin 0 sin ¢) + i(sin @cos ¢ + cos sin ¢)] 


Remembering now our basic trigonometric identities, this becomes 


zw = rs[cos(6 + ¢) + isin(@ + @)] 
=rscis(0 + ¢) 


Hence, when we multiply two complex numbers in polar form we 
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multiply the moduli and add the arguments. (It may be that the result 
of adding the arguments results in an angle greater than or equal to 
2x, and in this case we can simply subtract 27 from the answer; but 
this is a minor point.) 


Exercise 4.4 


(1) What is the locus of the point z in the complex plane which 
satisfies the equation |z| = 1? (i.e. Describe the set {zé@||z| = 1}.) 

(2) Let w=cis¢. Let U = {ze@||z| = 1]. Show that the following 
defines a function f: U > U: f(z) = wz. Is f bijective? Describe in 
geometric terms the behaviour of the function f 


A particular case of the above multiplication rule occurs when the 
two complex numbers involved are identical. We then get the rule 


(r cis 0)? = r? cis 20 
In particular, if r= 1, we have 
(cis 0)? = cis 20 
This generalises as follows. 


Theorem 4.1 (de Moivre’s theorem) 


For all ne, (cis 0)" = cis nd. 


Proof 
The proof is by induction on n. For n = | there is nothing to prove. We 
assume the result for n and prove it forn + 1. We have, using the result 
for n, 


(cis 0)" * 1 = (cis 6)"(cis 8) = (cis nO)(cis 8) 

= (cos né + isin n6)(cos 6 + isin 0) 

= cos nO cos @ + isin 8 cosné + isinn@ cos 6 
+i? sinn@ sind 

= (cos nOcos 6 — sinn@sin 0) + i(sin 8cos nO 
+ cos Osin n@) 

= cos(n + 8) + isin(@ +n) 

=cos(n + 1)6 + isin(n + 1)0 


The theorem is thus proved by induction. 
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Let us use de Moivre’s theorem to evaluate (1 + i)'°. In polar form, 
1+1 =/2(c0s5 +i sin) 
Thus, by de Moivre’s theorem, 
10 
0=¢/ 2)°( cos” +i sin) 
4 
2 2 
=? (cos( a + =) +isin (> + =) 
= 25 (cos +isin * 


= 2°(0 + il) 
=2°i 


We may also use de Moivre’s theorem in order to evaluate roots. Let 
us, for example, try to find the complex cube roots of 1. Suppose 
z=rcis@ is a complex cube root of 1. Thus, by de Moivre’s 
theorem, 


z=ricis30=1 
Clearly, if two complex numbers are equal, they have the same 


modulus. 
Hence 


ja[=P=|1|=1 


Thus, as r is a positive real, r = 1, and z = cis 6. We must determine 0. 
We do this by equating real and imaginary parts. We have: 


cos3@ + isin30=1+4+0i 
Hence 
cos 36 = 1, sin30 =0 


Now, we shall be interested only in values of 6 between 0 and 22 (ie. 
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0 <6 < 22). Hence we need only look at values of 36 between 0 and 
6z. In this region, the only solutions to the above equations are 


30 =0, 2z, 4x 
Hence 
2n 4n 
0 = 0, a 
3 
For 6 =0 we get the root 
cisO = 1 


For 6 = 27/3 we get the root 


(Notice that these last two roots are complex conjugates.) 
Another example. Let us find the square roots of 1 +i. Let z be a 
square root, and write z as z=rcis@. By de Moivre’s theorem, 


2 =r’cis20=1+i (4.1) 
Equating moduli, 
P=/(1? +1?) =,/2 


Hence r= g/ 2 and z= 4/ 2(cis 6), and equation (4.1) becomes 


1 
cis 20 = —~ 


1, 
cane 


Equating real and imaginary parts, cos 20 = 1 He 2, sin 26 = 1 ey 2. 
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Thus, to obtain 0 < 6 < 27, we have the solutions 


T On 
20 =— — 
i 4 
giving 
on 
d=— — 
: 8 


Hence the roots are o/ 2(cis 2/8) and / 2(cis 92/8). In this case let us 
leave them in this form. The point to notice is that we divided out by 
the modulus before evaluating the argument. This is important, since 
sin and cos only take values between — 1 and + 1. (Failure to divide 
out the modulus will not cause any problems, of course, providing we 
remember to carry it as a coefficient throughout the evaluation. 
However, it is perhaps wiser to adopt the procedure above.) 


Exercise 4.5 


(1) Use de Moivre’s theorem to evaluate the following: 


(a) (1-i)° 
(b) [8+ 8/3)i]” 


(2) Use de Moivre’s theorem to evaluate the cube roots ofiand —i. 
(3) Use de Moivre’s theorem to evaluate the square roots of 8 
+ (8,/3)i. 

(4) Use de Moivre’s theorem to evaluate the complex 6th roots of 729. 
(5) Without actually calculating the roots in an a + bi form, plot on 
the complex plane the square roots of +1, +i, +(1+ i)A/ 2: 

(6) Plot on the complex plane the 6th roots of 1. 

(7) Obtain a formula (in terms of n) for the nth roots of 1. 

(8) Both by direct calculation and by reference to a diagram, prove 
that if z, w are the two non-real cube roots of 1, then z? = w and 
w? =z. 


4.8 Algebraic equations 


It is well known that in &, the quadratic equation 


ax? + bx +c=0 
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has a solution iff b* > 4ac, in which case the roots are given by the 
formula 


_ —b+,/(b? — 4ac) 


x= MG (a # 0) 


Consider now a similar equation in @: 
az’? +bz+c=0 


(where we allow a, b, c to be complex also). The algebra which led to 
the above formula still works. Hence the roots are given by 


—b+,/(b? — 4ac) 


Z= m (a #0) 


But there is no condition restricting the validity of this formula now. 
The equation always has a solution, and the formula always gives the 
root. For example, consider the equation 


2+z+1=0 


Its roots are given by 


—14+/(-41-1) 
21 


Z= 


(You may check this by substituting back in the original equation.) 
Or consider the cubic equation 


223 + 327 +2z74+3=0 
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This factorises as 
(z? + 1)(2z + 3)=0 


Hence the roots are z=i, z= —i, z= — 3/2. 

The point is this. Given a quadratic or cubic equation (or worse), we 
can set about finding its roots in the usual manner, but now we are not 
prevented from proceeding by virtue of needing to form, say, the 
square root of a negative number. (Since of —-r= (fi r)i). Moreover, 
the coefficients may also be complex. 

In fact there is a general result here. Consider any complex 
polynomial equation (with complex coefficients) 


n n-1 = 
AZ" +a,z"" * +...4+4a,-,2+a,=9 


Then this equation has a (complex) root. This basic result is known as 
the fundamental theorem of algebra. It is truly remarkable. For what it 
amounts to is a proof of the fact that our search for a number system 
which is adequate for mathematics is complete. A fact which 
experience would not have led us to expect. For consider once more 
our progress. Starting with we observed that -/ did not allow us to 
solve all equations of the form x +m =n. So we extended .V to &. 
But in Z we could not solve all equations of the form mx = n. So we 
extended # to 2. In 2 we could not solve all equations of the form 
x? = k(k > 0), so we extended 2 to Z. One might by now expect that 
this procedure will continue indefinitely. But no, if we make one more 
step now (essentially introducing just the single new root of the 
equation x” + 1 = 0), then all equations have solutions. In fact much 
more is the case. The study of complex numbers is one of the most 
beautiful and elegant branches of mathematics, the full appreciation 
of which can only be gained by a detailed investigation of the subject. 
This gateway into one of the most fascinating parts of mathematics 
forms a suitable place for us to end our account. 


Exercise 4.6 


(1) Solve the following equations: 


(a) 427+2+9=0 
(b) 23—(5+i)z7 + (6 + Siz -—6i1=0 (Try z=1!) 
(c) z?7—(5—i)z—Si=0 
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(2) Express the following polynomial expressions as a product of 
linear factors: 


(a) 27-1 
(b) 2741 
(c) z*-1 
(d) z*+1 


(3) Consider the polynomial equation 
z"+a,2" 1+... +4,-1z7+a,=0 
wherea,,...,q, are real. Prove that if z is a root of this equation, 
then so is z. 
(4) Consider the equation z” — 2z + 5 = 0. Given that one root of this 
equation is z= 1 + 2i, what is the other root? 
(5) Use de Moivre’s theorem to prove that the equation 


az"+b=0 


has a complex root for any complex values of a, b (a # 0). 


List of symbols 


eS 
3 
> 
S 


Hed 1 < > 


xe NV), etc. 


e2<pwns 


xe NW ), etc. 


~ 


ROSKRL IY. 


latent 


sey 
{x|A(x)} 
{xe X|A(x)} 


N® 


Ni 


f(a) 
|x| 

Ih 
F(X] 
feg 
f =1 
Lu.b. 
ch 
(a, b) 
[a, b] 
[a, b), (a, b] 
{a,} 
a,—a 
(x, y) 
Re(z) 
Im(z) 
i 
x+iy 
Zz 

(r, 0) 
cis 8 
arg(z) 
Iz| 


Index 


Absolute value, 48 

All, 16 

And, 3 

Archimedean property, 58 
Argument, 79 

Axiom, 30 


Bijection, 47 


Cartesian product, 69 
Closed interval, 60 
Codomain, 47 
Complement, 40 
Completeness property, 56 
Complex number, 69 
Complex plane, 71 
Composition function, 51 
Conjugate, 76 
Conjunction, 5 
Connective, 12 
Contradiction, 21 


Disjoint, 39 
Disjunction, 5 
Domain, 47 


Element, 32 
Empty set, 35 
Equivalent, 7 
Euclidean plane, 69 
Exists, 15 


Function, 44, 47 
Fundamental theorem of 
algebra, 86 


Identity function, 49 


If, 10 

Iff, 11 

Imaginary axis, 72 
Imaginary number, 73 
Imaginary part, 71 
Implies, 6 
Induction, 25 
Injection, 46 
Integers, 66 
Intersection, 38 
Interval, 60 

Inverse function, 53 
Invertible function, 52 
Isomorphic, 72 


Least upper bound, 56 
Limit, 63 


Map, mapping, 48 
Member, 32 

Modulus, 48, 79 

de Moivre’s theorem, 81 
de Morgan laws, 41 


Natural number, 66 
Necessary, 11 


Negation, 5 

Not, 4 

Null set, 35 
Number system, 66 
One-one, 46 
Onto, 46 

Open interval, 60 
Or, 4 

Pair, 69 


Polar coordinates, 79 
89 


90 


Polar representation, 79 
Positive statement, 18 
Postulate, 30 

Power set, 36 

Proper subset, 36 
Proof, 20 


Quantifier, 17 


Range, 49 

Rational number, 67 
Real axis, 72 

Real line, 71 

Real number, 56, 67 
Real part, 71 


Sequence, 62 


INDEX 


Set, 32 
Statement, 1 
Subset, 35 
Sufficient, 11 
Surjection, 46 


Transformation, 48 
Triangle inequality, 59, 60 
Truth, 29 

Truth table, 12 


Union, 36 
Universal set, 37 
Upper bound, 56 


Value, 47 
Venn diagram, 38 


